BU-HEPP-08-20 



A Simple Introduction to Particle Physics 
Part I - Foundations and the Standard Model 

Matthew B. Robinson, f] Karen R. Bland, 
Gerald B. Cleaver,]^ and Jay R. Dittmann|_ 

Department of Physics, One Bear Place # 97316 
Baylor University 
Waco, TX 76798-7316 

Abstract 

This is the first of a series of papers in which we present a brief introduction to the rele- 
vant mathematical and physical ideas that form the foundation of Particle Physics, in- 
cluding Group Theory, Relativistic Quantum Mechanics, Quantum Field Theory and 
Interactions, Abelian and Non-Abelian Gauge Theory, and the SU{3) ® SU{2) [/(I) 
Gauge Theory that describes our universe apart from gravity. Our approach, at first, is 
an algebraic exposition of Gauge Theory and how the physics of our universe comes 
out of Gauge Theory. 

With an algebraic understanding of Gauge Theory and the relevant physics of the 
Standard Model from this paper, in a subsequent paper we will "back up" and refor- 
mulate Gauge Theory from a geometric foundation, showing how it connects to the 
algebraic picture initially built in these notes. 

Finally, we will introduce the basic ideas of String Theory, showing both the geometric 
and algebraic correspondence with Gauge Theory as outlined in the first two parts. 

These notes are not intended to be a comprehensive introduction to any of the ideas 
contained in them. Their purpose is to introduce the "forest" rather than the "trees". 
The primary emphasis is on the algebraic/ geometric/ mathematical underpinnings 
rather than the calculational/phenomenological details. Among the glaring omis- 
sions are CPT theorems, evaluations of Feynman Diagrams, Renormalization, and 
Anomalies. The topics were chosen according to the authors' preferences and agenda. 

These notes are intended for a student who has completed the standard undergrad- 
uate physics and mathematics courses. The material in the first part is intended as a 
review and is therefore cursory. Furthermore, these notes should not and will not in 
any way take the place of the related courses, but rather provide a primer for detailed 
courses in QFT, Gauge Theory, String Theory, etc., which will fill in the many gaps left 
by this paper. 
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1 Part I — Preliminary Concepts 
1.1 Review of Classical Physics 
1.1.1 Hamilton's Principle 

Nearly all physics begins with what is called a Lagrangian for a particle, which is initially 
dejfined as the kinetic energy minus the potential energy 

L = T-V 

where T — T{q,q) and V — V{q). Then, the Action is defined as the integral of the 
Lagrangian from an initial time to a final time. 



It is important to realize that 5 is a "functional" of the particle's world-line in {q, q) space, 
not a function. This means that it depends on the entire path (g, q), rather than a given 
point on the path. The only fixed points on the path are q{ti) , q{t f) , q{ti) , and q{tf). The 
rest of the path is generally unconstrained, and the value of S depends on the entire path. 

Hamilton's Principle says that nature extremizes the path a particle will take in going 
from q{ti) at time tj to position q{tf) at time tf. In other words, the path that extremizes 
the action will be the path the particle will travel. 

But, because is a functional, depending on the entire path in (g, q) space rather than a 
point, it cannot be extremized in the "Calculus 1" sense of merely setting the derivative 
equal to 0. Instead, we must find the path for which the action is "stationary". This means 
that the first-order term in the Taylor Expansion around that path will vanish, ox 5S — Q 
at that path. 

To find this, consider some arbitrary path (g, q). If it is a path that minimizes the action, 
then we will have 



Integrating the second term by parts, and taking the variation of 5g to be at at U and tf, 





dtL{q -\- Sq,q-\- Sq) — S 





dL _ d^dL 
dq dt dq 



= 



5 



The only way to guarantee this for an arbitrary variation 5q from the path (g, q) is to 
demand 

dOL _dL _^ 
dt dq dq 

This equation is called the Euler-Lagrange equation, and it produces the equations of 
motion of the particle. 

The generalization to multiple coordinates qi{i = 1, . . . , n) is straightforward: 

dt oqi oqi 



1.1.2 Noether's Theorem 



Given a Lagrangian L = L{q, q), consider making an infinitesimal transformation 

q ^ q + eSq 

where e is some infinitesimal constant. This transformation will give 

dL dL 

L{q, q) L{q + e5q, q + e5q) = L{q, q) + e(5g— + 

If the Euler-Lagrange equations of motion are satisfied, so that — then under 

q^q + eSq, 

BL ^.dL ^ ddL dL d ^ ^ dfdL' 
L^ L + edq— + edq— = L + eSq—— + e——dq = L + —[ jrredq 
oq dq dt dq dq dt dt \ dq 

So, under q ^ q -\- e5q, we have 5L — . We define the Noether Current, j, as 

Now, if we can find some transformation 5q that leaves the action invariant, or in other 
words such that 5S = then ^ = 0, and therefore the current j is a constant in time. In 
other words, j is conserved. 

As a familiar example, consider a projectile, described by the Lagrangian 

L = ]^m{x^ + y^) - mgy (1.2) 

This will be unchanged under the transformation a; — > a; + e, where e is any constant 
(here, 5q — Im. the above notation), because x^x-\-e^x^x. So, j = ^5q ~ mx is 
conserved. We recognize mx as the momentum in the x-direction, which we expect to be 
conserved by conservation of momentum. 

So in summary, Noether's Theorem merely says that whenever there is a continuous 
symmetry in the action, there is a corresponding conserved quantity. 
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1.1.3 Conservation of Energy 



Consider the quantity 



dL d dL dq dL dq dL 

~dt^Jt ^^''^^ ^ ~d^di^ ~d^dt^ ~dt 



Because L does not depend explicitly on time, §f = 0, and therefore 

dL_dL. dL.._fddL\. dL _ d f dL \ 
dt dq^ dq^ \dt dq dq^ dt\ dq ^/ 



where we have used the Euler-Lagrange equation to get the second equality. So, we have 

dL d^f r>r 

dt dtKdq'^)'^^ 



"^^-,-^1=0 (1.3) 



dt \ dq 



For a general non-relativistic system, L = T — V,so^ = ^ because V is a function of q 
only, and normally 

T oc g2 ^ ^Q = '^T 
dq 

So, ^q — L = 2T—{T — V) = T+V = E, the total energy of the system, which is conserved 
according to (1.3). We identify T + V = H as the Hamiltonian, or total energy function, 
of the system. 

Furthermore, we define || = p to be the momentum of the system. Then, the relationship 
between the Lagrangian and the Hamiltonian is the Legendre transformation 

pq — L = H 



1.1.4 Lorentz Transformations 



Consider some event that occurs at spatial position (x, y, z)'^, at time t. (The superscript 
T denotes the transpose, so this is a column vector.) We arrange this event in a column 
4-vector as (ct, x, y, zY , where c is the speed of light (the units of c give each element the 
same units). A more useful notation is to refer to this vector as = {ct, x, y, z)'^ , where 
/i = 0, 1, 2, 3. This 4-vector, with the ji index raised, is called a "vector", or a "contravariant 
vector". Then, we define the row vector = {—ct, x, y, z). This is called a "covector", or 
a "covariant vector". In general, the sign of the 0*'' component (the component in the first 
position) changes when going from vector to covector. 
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There is something very deep going on here regarding the geometrical picture between 
vectors and covectors, but we will not discuss it until the next paper in this series. 



The dot product between two such vectors (a covector and vector) is then defined as the 
product with one index raised and the other lowered. Whenever indices are contracted 
in such a way, it is understood that they are to be summed overj^ 

a-b = = a%o + a^bi + a%2 + a^h = -a%° + a^b^ + a^"^ + 
Or, plugging in the spacetime notation from above, where 

a'' = {cti,xi,yi,zi)^ and = {ct2,X2,y2, zz)'^ 

we have 

a-h = aj)^ = -c^tit2 + X1X2 + 2/1I/2 + Z1Z2 



We can also discuss the differential version of this. If = {ct, x, y, z), then ds^ = —c^dt^ + 
dx"^ + dy"^ + dz"^. 

In his theory of Special Relativity, Einstein postulated that all inertial reference frames 
are equivalent, and that the speed of light is the same in all frames. To put this in more 
mathematical terms, if observers in different inertial frames 1 and 2 each see an event, 
they will see, respectively, 

ds\ = —(?dt\ + dx\ + dy\ + dz\ 
ds\ = —(?dt\ + dx\ + dyl + dz2 

We then demand that ds\ = ds^. To do this, we must find a modification of the standard 
Galilean transformations that will leave ds'^ unchanged. The derivation for the correct 
transformations can be found in any introductory or modern physics text, so we merely 
quote the result. If we assume that frame 2 is moving only in the z-direction with respect 
to frame 1 (and that their x, y, and z axes are aligned), then we find that the transforma- 
tions are 

t2 = 7(cti - Pzi) 

Z2 = ^{zi-l3cti) (1.4) 

where /? = f and 7 = y=^- These transformations, which preserve ds^ when transform- 
ing one frame to another, are called Lorentz Transformations. 

Discussions of the implications of these transformations, including time dilation, length 
contraction, and the relationship between energy and mass can be found in most intro- 
ductory texts. You are encouraged to review the material if you are not familiar with 
it. 

^because we are summing over components, we can write a^^h^ or a^h^ — they mean the same thing 
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1.1.5 A More Detailed Look at Lorentz Transformations 



As we have seen, we have a quantity ds^ = —c^dt^ + dx^ + dy^ + dz^, which does not 



change under transformations (1.4 1. Thinking of physical ideas this way, in terms of 
"what doesn't change when something else changes", will prove to be an extraordinarily 
powerful approach. In order to understand Special Relativity in such a way, we begin 
with a simpler example. 

Consider a spatial rotation around, say, the ^-axis (or, equivalently, mixing the x and y 
coordinates). Such a transformation is called an Euler Transformation, and takes the 
form 

t' = t 

x' = X cos 6 + y sin 9 
y' = —X sin 9 + y cos 9 

z = z (1.5) 

where 9 is the angle of rotation, called the Euler Angle. We can simultaneously express a 
Lorentz transformation as a sort of "rotation" that mixes a spatial dimension and a time 



dimension, as follows (these transformations are equivalent to (1.4): 



t' = t cosh 9 — X sinh 9 
x' = —t sinh 9 + X cosh 9 

y' = y 

z' = z (1.6) 
where 9 is defined by the relationship P = tan 9. 

We denote a transformation mixing two spatial dimensions simply a Rotation, whereas 
a transformation mixing a spatial dimension and a time dimension is a Boost. Any two 
frames whose origins coincide at t = t' = can be transformed into each other through 
some combination of rotations and boosts. 



To rephrase this in more precise language, given a 4-vector x'', it will be related to the 
equivalent 4-vector in another frame, x'^, by some matrix L, according to x'^ = L^x" 
(where the summation convention discussed earlier is in effect for the repeated index). 

We also introduce what is called the Metric matrix, 

0\ 

1 
1 
1/ 

In general, r|^"' = {r]^^)~'^. 



flU 



/-I 
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Using the metric, the dot product of any 4-vector = {ct, x, y, zY can be easily written 
as x"^ = x^^x^ = ri^yX^x" = —c^t^ + a;^ + + z^. In general, a Lorentz transformation can be 
defined as a matrix (including boosts and rotations) that leaves rf^yX^x" unchanged. 

For example, a scalar, or an object with no uncontracted indices, like or x^x^„ is simply 
invariant under Lorentz transformations (0 cj), x^x^ — > x^x^). 

A vector, or an object with only one uncontracted index, like x^ or a^b'^, transforms ac- 
cording to x'^" = Li^x", or (a^6p' = L^(a^6p. 

Now, consider the dot product = x^x^ = rf^yX^x" . If is invariant, then x'^ = x^ ^ 
r]^ux'^x'^ = rj^^L'^L'^^x^x^ demands that r^^^L^L^ = rja/s. So, the constraint for Lorentz 
transformations is that they are the set of all matrices such that 

We take this to be the defining constraint for a Lorentz transformation. 



1.1.6 Classical Fields 



When deriving the Euler-Lagrange equations, we started with an action S which was an 
integral over time only {S = J dtL). If we are eventually interested in a relativistically 
acceptable theory, this is obviously no good because it treats time and space differently 
(the action is an integral over time but not over space). 

So, let's consider an action defined not in terms of the Lagrangian, but of the "Lagrangian 
per unit volume", or the Lagrangian Density C. The Lagrangian will naturally be the 
integral of C over all space, L = J d"xC. The integral is in n-dimensions, so d'^x means 

dx^dx'^dx'^ ■ ■ ■ dx"'. 

Now, the action will be = / dtL = J (it(i"x£. In the normal 1+3 dimensional Minkowski 
spacetime we live in, this will be S* = / dtd^xC = J d^xC. 

Before, L depended not on t, but on the path q(t), q{t). In a similar sense, C will not 
depend on x and t, but on what we will refer to as Fields, 0(x, t) = 0(x^), which exist in 
spacetime. 

Following a nearly identical argument as the one leading to ( 1.1[ >, we get the relativistic 
field generalization 



dC \ dC 



for multiple fields 0, (z = 1, . . . , ra). 
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Noether 's Theorem says that, for cj) ^ (f) + e5(j), we have a current j'^ = Q^lrfjS(l>> and if </> ^ 

(j) + e6(f) leaves 6C = 0, then d^j'^ = 0^— ^ + V-j = 0, where j° is the Charge Density, 
and j is the Current Density. The total charge will naturally be Q = ^^^^^ d^xj'^. 

Finally, we also have a Hamiltonian Density and momentum 

dC ■ 

n ^ -wr^,-^ (1-7) 
n'^ = ^ (1.8) 



One final comment for this section. For the remainder of these notes, we will ultimately 
be seeking a relativistic field theory, and therefore we will never make use of Lagrangians. 
We will always use Lagrangian densities. We will always use the notation C instead of L, 
but we will refer to the Lagrangian densities simply as Lagrangians. We drop the word 
"densities" for brevity, and because there will never be ambiguity. 



1.1.7 Classical Electrodynamics 



We choose our units so that c = /io = eo = 1- So, the magnitude of the force between two 
charges qi and q2is F = In these units. Maxwell's equations are 

V-E = p (1.9) 

VxB-— = J (1.10) 
ot 

V-B = (1.11) 

VxE+— = (1.12) 



If we define the Potential 4-vector A'^ = (0, A), then we can define B = V x A and 
E = — V0 — Writing B and E this way will automatically solve the homogenous 
Maxwell equations, ( 1.11[ > and (1.12 1. 

Then, we define the totally antisymmetric Electromagnetic Field Strength Tensor F^'^ as 



/o 

Ex 

Ey 

\E, 



-Ex 


-Ey 


-eA 







By 







—Bx 


-By 


Bx 


J 



We define the 4-vector current as = (p, J). It is straightforward, though tedious, to 
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show that 



V-i? = and VxE + — = 

ot 



V ■ E = p and V x 5 
1.1.8 Classical Electrodynamics Lagrangian 



dE 
~dt 



J 



Bringing together the ideas of the previous sections, we now want to construct a La- 
grangian density C which will, via Hamilton's Principle, produce Maxwell's equations. 

First, we know that C must be a scalar (no uncontracted indices). From our intuition with 
"Physics 1" type Lagrangians, we know that kinetic terms are quadratic in the derivatives 
of the fundamental coordinates (i.e. ^mx^ = ^'rn{^) ■ (^)). The natural choice is to take 
as the fundamental field. It turns out that the correct choice is 



^EM — —^Ffj_yF^^ — J^A^ 
(note that the term is quadratic in d^A^). So, 



S 



d X 



4 



F^^F^- - rA^ 



(1.13) 



(1.14) 



Taking the variation of (1.14) with respect to A^, 

SS = 



d^x 


- ^F.JF^'' - 


\5F,,F^^ - 


- rsA^ 


d^x 


- ^F^JFi^' - 


J^5A^ 






d'^x 


- lF,^id>^SA-^ 


- d^'SA^') - 




d^'x 


- F^,d^5A' - 


J^5A^ 







Integrating the first term by parts, and choosing boundary conditions so that 5 A vanishes 
at the boundaries. 



d'^x 
d^x 



d^F^^SA, - r6A, 



df.F^" - r 



So, to have 5S = 0, we must have d^F^^ = J" , and if this is written out one component 
at a time, it will give exactly the inhomogenous Maxwell equations ( 1.9[ > and ( 1.10[ >. And 
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as we already pointed out, the homogenous Maxwell equations become identities when 
written in terms of A^. 



As a brief note, the way we have chosen to write equation ( 1.13 ), in terms of a "potential" 
Afj,, and the somewhat mysterious antisymmetric "field strength" F^jj,, is indicative of an 
extremely deep and very general mathematical structure that goes well beyond classical 
electrodynamics. We will see this structure unfold as we proceed through these notes. 
We just want to mention now that this is not merely a clever way of writing electric and 
magnetic fields, but a specific example of a general theory. 



1.1.9 Gauge Transformations 

Gauge Transformations are usually discussed toward the end of an undergraduate 
course on E&M. Students are typically told that they are extremely important, but the 
reason why is not obvious. We will briefly introduce them here, and while their signifi- 
cance may still not be transparent, we will return to them several times throughout these 
notes. 



Given some specific potential A^, we can find the field strength action as in (1.141. How- 
ever, A^ does not uniquely specify the action. We can take any arbitrary function xi^'^)' 
and the action will be invariant under the transformation 

A'^ = A^' + d^x (1-15) 



or 



A>^^A"^ = {<P-^,A + Vx) 



Under this transformation, we have 



p'l^u ^ d''A"'-d''A''' = d''{A'' + d''x)-d''{A'' + d''x) 
= d^A" - d^A" + d^d^x - d^d^'x 

= F^^ (1.16) 



So, F'^"' = F'"'. 



Furthermore, J^A^ J^A^ + J^d^x- Integrating the second term by parts with the usual 
boundary conditions. 



x{d^r)x 



But, according to Maxwell's equations, d^J^ = d^d„F^'-'^ = because F^^" is totally anti- 



symmetric. So, both F^'^ and J^d^x invariant under (1.15), and therefore the action of 
S is invariant under ( 1.15| >. 
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While the importance of gauge transformations may not be obvious at this point, it will 
become perhaps the most important idea in particle physics. As a note before moving on, 
recall previously when we mentioned the idea of "what doesn't change when something 
else changes" when talking about Lorentz transformations. A gauge transformation is 
exactly this (in a different context): the fundamental fields are changed by x> but the 
equations which govern the physics are unchanged. 

In the next section, we provide the mathematical tools to understand why this idea is so 
important. 



1.2 References and Further Reading 



The material in this section can be found in nearly any introductory text on Classical 
Mechanics, Classical Electrodynamics, and Relativity. The primary sources for these notes 
are 0, ma, and lH. 



For further reading, we recommend 0, CSl, dl, 1123/ H, and El- 
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2 Part II — Algebraic Foundations 



2.1 Introduction to Group Theory 



There are several symbols in this section which may not be familiar. We therefore provide 
a summary of them for reference. 



N={0,1,2,3,...} 

Z = {0,±1,±2,±3,...} 

Q = Rational Numbers 

M = Real Numbers 

C = Complex Numbers 

Z„ = Z mod n 



^ is read "implies" 
iff is read "if and only if 
V is read "for every" 
3 is read "there exists" 

G is read "in" 

3 is read "such that" 



= is "represented by" 
C is "subset of 
= is "defined as" 



Now that we have reviewed the primary ideas of classical physics, we are almost ready to 
start talking about particle physics. However, there is a bit of mathematical "machinery" 
we will need first. Namely, Group Theory. 

Group theory is, in short, the mathematics of symmetry. We are going to begin talking 
about what will seem to be extremely abstract ideas, but eventually we will explain how 
those ideas relate to physics. As a preface of what is to come, the most foundational idea 
here is, as we said before, "what doesn't change when something else changes". A group 
is a precise and well-defined way of specifying the thing or things that change. 



2.1.1 What is a Group? 

To begin with, we define the notion of a Group. This definition may seem cryptic, but it 
will be explained in the paragraphs that follow. 

A group, denoted {G, is a set of objects, denoted G, and some operation on those objects, de- 
noted -k, subject to the following: 

1. y gi, 92 & G, giic g2 e G also, (closure) 

2- V gi, g2, gs G G, it must be true that {gi ★ g2) ^ga = gi^ (fl'2 ^ fl's)- (associativity) 

3. 3g G G, denoted e, 3 \/gi EG, e-k gi = gi-k e = gi. (identity) 

4. ^g e G,3h e G 3 hk g = g k h = e, (so h = g~^). (inverse) 

Now we explain what this means. By "objects" we literally mean anything. We could be 
talking about Z or R, or we could be talking about a set of Easter eggs all painted different 
colors. 
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The meaning of "some operation", which we are calling i^, can literally be anything you 
can do to those objects. A formal definition of what ★ means could be given, but it will be 
easier to understand with examples. 

Note: The definition of a group doesn't demand that gi -k gj = g.j ★ gi. This is a very 
important point, but we will discuss it in more detail later. We mention it now so it is not 
new later. 

Example 1: (G, = (Z, +) 

Consider the set G to be Z, and the operation to be * = +, or simply addition. 

We first check closure. If you take any two elements of Z and add them to- 
gether, is the result in Z? In other words, if a, 6 e Z, is a + 6 e Z? Obviously 
the answer is yes; the sum of two integers is an integer, so closure is met. 

Now we check associativity. If a, 6, c e Z, it is trivially true that a+ {b + c) — 
{a + b) + c. So, associativity is met. 

Now we check identity. Is there an element e G Z such that when you add e 
to any other integer, you get that same integer? Clearly the integer satisfies 
this. So, identity is met. 

Finally, is there an inverse? For any integer a G Z, will there be another integer 
6 G Z such that a + b = e = 0? Again, this is obvious, a^^ = —a in this case. So, 
inverse is met. 

So, (G, -k) = (Z, +) is a group. 

Example 2: {G, ^) = (M, +) 

Obviously, any two real numbers added together is also a real number. 
Associativity will hold (of course). 
The identity is again 0. 

And finally, once again, —a will be the inverse of any a G R. 

Example 3: {G, = (R, •) (multiplication) 

Closer is met; two real numbers multiplied together give a real number. 
Associativity obviously holds. 

Identity also holds. Any real number a G M, when multipled by 1 is a. 

Inverse, on the other hand, is trickier. For any real number, is there another 
real number you can multiply by it to get 1? The instinctive choice is = K 
But, this doesn't quite work because of a = 0. This is the only exception, but 
because there's an exception, (R, •) is not a group. 

Note: If we take the set R - {0} instead of R, then (R - {0}, •) is a group. 



16 



Example 4: (G,*) = ({1}, •) 



This is the set with only the element 1, and the operation is normal multiplica- 
tion. This is indeed a group, but it is extremely uninteresting, and is called the 
Trivial Group. 

Example 5: {G, ★) = (Zg, +) 

This is the set of integers mod 3, containing only the elements 0, 1, and 2 
(3 mod 3 is 0, 4 mod 3 is 1, 5 mod 3 is 2, etc.) 

You can check yourself that this is a group. 
2,1,2 Finite Discrete Groups and Their Organization 

From the examples above, several things should be apparent about groups. One is that 
there can be any number of objects in a group. We have a special name for the number of 
objects in the group's set. The Order of a group is the number of elements in it. 

The order of (Z, +) is infinite (there are an infinite number of integers), as is the order of 
(M, +) and (M - {0}, •). But, the order of ({1}, ■) is 1, and the order of (Z3, +) is 3. 

If the order of a group is finite, the group is said to be Finite. Otherwise it is Infinite. 

It is also clear that the elements of groups may be Discrete, or they may be Continuous. 

For example, (Z, +), ({!}, •), and (Z3, +) are all discrete, while (R, +) and (R — {0}, •) are 
both continuous. 

Now that we understand what a discrete finite group is, we can talk about how to orga- 
nize one. Namely, we use what is called a Multiplication Table. A multiplication table is 
a way of organizing the elements of a group as follows: 





e 


9i 


92 




e 


e-k e 


e-kgi 


e-kg2 




91 


gi-ke 


91-^91 


91-^92 




92 


g2-ke 


92-^91 


92-^92 















We state the following property of multiplication tables without proof. A multiplication 
table must contain every element of the group exactly one time in every row and every column. A 
few minutes thought should convince you that this is necessary to ensure that the defini- 
tion of a group is satisfied. 
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As an example, we will draw a multiplication table for the group of order 2. We won't 
look at specific numbers, but rather call the elements gi and g2. We begin as follows: 





e 


9i 


e 


7 


7 


9i 


7 


7 



Three of these are easy to fill in from the identity: 





e 


9i 


e 


e 


9i 


91 


91 


7 



And because we know that every element must appear exactly once, the final question 
mark must be e. So, there is only one possible group of order 2. 

We will consider a few more examples, but we stress at this point that the temptation to 
plug in numbers should be avoided. Groups are abstract things, and you should try to 
think of them in terms of the abstract properties, not in terms of actual numbers. 

We can proceed with the multiplication table for the group of order 3. You will find that, 
once again, there is only one option. (Doing this is instructive and it would be helpful to 
work this out yourself.) 





e 


91 


92 


e 


e 


9i 


92 


9i 


9i 


92 


e 


92 


92 


e 


9i 



You are encouraged to work out the possibilities for groups of order 4. (Hint: there are 4 
possibilities.) 

2.1.3 Group Actions 

So far we have only considered elements of groups and how they relate to each other. The 
point has been that a particular group represents nothing more than a structure. There 
are a set of things, and they relate to each other in a particular way. Now, however, we 
want to consider an extremely simple version of how this relates to nature. 
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Example 6 



Consider three Easter eggs, all painted different colors (red, orange, and yel- 
low), which we denote R, O, and Y. Now, assume they have been put into a 
row in the order (ROY). If we want to keep them lined up, not take any eggs 
away, and not add any eggs, what we can we do to them? We can do any of 
the following: 

1. Let e be doing nothing to the set, so e{ROY) = (ROY). 

2. Let gi be a cyclic permutation of the three, gi{ROY) = (OYR) 

3. Let §2 be a cyclic permutation in the other direction, g2{R0Y) = (YRO) 

4. Let g^i be swapping the first and second, g2,{R0Y) = (ORY) 

5. Let g4 be swapping the first and third, gi{ROY) = (YOR) 

6. Let g^ be swapping the second and third, g5{R0Y) = (RYO) 

You will find that these 6 elements are closed, there is an identity, and each has 
an inverse]^ So, we can draw a multiplication table (you are strongly encour- 
aged to write at least part of this out on your own): 





e 


9i 


92 


93 


94 


95 


e 


e 


9i 


92 


93 


94 


95 


9i 


9i 


92 


e 


95 


93 


94 


92 


92 


e 


9i 


94 


95 


93 


93 


93 


94 


95 


e 


91 


92 


94 


94 


95 


93 


92 


e 


91 


95 


95 


93 


94 


91 


92 


e 



There is something interesting about this group. Notice that g^^gi = g^, whereas gi^g^ = 
g^. So, we have the surprising result that in this group it is not necessarily true that 

9i * 9j = 9j * 9i- 

This leads to a new way of classifying groups. We say a group is Abelian if gi -k gj = gj ^ir g^ 
^9i-,9j e G. If a group is not Abelian, it is Non- Abelian. 

Another term commonly used is Commute. If gi ^ gj = gj ★ gi, then we say that gi and 
gj commute. So, an Abelian group is Commutative, whereas a Non-Abelian group is 
Non-Commutative . 

^We should be very careful to draw a distinction between the elements of the group and the objects the 
group acts on. The objects in this example are the eggs, and the permutations are the results of the group 
action. Neither the eggs nor the permutations of the eggs are the elements of the group. The elements of 
the group are abstract objects which we are assigning to some operation on the eggs, resulting in a new 
permutation 
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The Easter egg group of order 6 above is an example of a very important type of group. It 
is denoted 5*3, and is called the Symmetric Group. It is the group that takes three objects 
to all permutations of those three objects. 

The more general group of this type is S'„, the group that takes n objects to all permu- 
tations of those objects. You can convince yourself that Sn will always have order n\ {n 
factorial). 

The idea above with the 3 eggs is that is the group, while the eggs are the objects that 
the group acts on. The particular way an element of S3 changes the eggs around is called 
the Group Action of that element. And each element of S3 will move the eggs around 
while leaving them lined up. This ties in to our overarching concept of "what doesn't 
change when something else changes". The fact that there are 3 eggs with 3 particular 
colors lined up doesn't change. The order they appear in does. 

2.1.4 Representations 

We suggested above that you think of groups as purely abstract things rather than trying 
to plug in actual numbers. Now, however, we want to talk about how to see groups, 
or the elements of groups, in terms of specific numbers. But, we will do this in a very 
systematic way. The name for a specific set of numbers or objects that form a group is 
a Representation. The remainder of this section (and the next) will primarily be about 
group representations. 

We already discussed a few simple representations when we discussed (Z, +), (M — {0}, •), 
and (Z3, +). Let's focus on (Z3, +) for a moment (the integers mod 3, where e = 0, gi = 1, 
g2 = 2, with addition). Notice that we could alternatively define e = 1, gi = , and g2 = 
e^, and let ★ be multiplication. So, in the "representation" with (0,1,2) and addition, we 
had for example 

^1 ★ g,2 = (1 + 2) mod 3 = 3 mod 3 = = e 
whereas now with the multiplicative representation we have 

5*1 * 5*2 = e 3 -63 = e = e = 1 = e 
So the structure of the group is preserved in both representations. 

We have two completely different representations of the same group. This idea of differ- 
ent ways of expressing the same group is of extreme importance, and we will be using it 
throughout the remainder of these notes. 

We now see a more rigorous way of coming up with representations of a particular group. 
We begin by introducing some notation. For a group (G,*) with elements gi, g2, . ■ ., we 
call the Representation of that group D{G), so that the elements of G axe D{e), D{gi), 
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D{g2) (where each D{gi) is a matrix of some dimension). We then choose ★ to be matrix 
multiplication. So, D{gi) ■ D{gj) = D{gi~k g^). 

It may not seem that we have done anything profound at this point, but we most defi- 
nitely have. Remember above that we encouraged seeing groups as abstract things, rather 
than in terms of specific numbers. This is because a group is fundamentally an abstract 
object. A group is not a specific set of numbers, but rather a set of abstract objects with a 
well-defined structure telling you how those elements relate to each other. 

And the beauty of a representation D is that, via normal matrix multiplication, we have 
a sort of "lens", made of familiar things (like numbers, matrices, or Easter eggs), through 
which we can see into this abstract world. And because D{gj) ■ D{gj) = D{gi ★ gj), we 
aren't losing any of the structure of the abstract group by using a representation. 

So now that we have some notation, we can develop a formalism to figure out exactly 
what D is for an arbitrary group. 

We will use Dirac vector notation, where the columjn vector 



V2 
\'-J 



\v) 



and the row vector 

So, the dot product between two vectors is 



V ■ u 



[vi V2 V3-- ■) 



U2 
U3 



ViUi + V2U2 + V3US H = {v\u) 



Now, we proceed by relating each element of a finite discrete group to one of the standard 
orthonormal unit vectors: 

e ^ |e) = |ei) gi \gi) = |e2) g2 ^ \g2) = jes) 

And we define the way an element in a representation D{G) acts on these vectors to be 

Di9i)\9j) = \9i-^9j) 



Now, we can build our representation. We will (from now on unless otherwise stated) 
represent the elements of a group G using matrices of various sizes, and the group opera- 
tion -k will be standard matrix multiplication. The specific matrices that represent a given 
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element gu of our group will be given by 

[D{gk)]ij = {g^\D{gu)\g,) (2.1) 

As an example, consider again the group of order 2 (we wrote out the multiplication table 
above on page[l8]). First, we find the matrix representation of the identity, [D{e)\ij, 

[D{e)]ii = {e\D{e)\e) = (e|e * e) = (e|e) = 1 

[D{e)]u = {e\D{e)\gi) = {e\e ^ gi) = {e\gi) = 

[Die)]2i = {gi\Die)\e) = {g,\e ^ e) = {g,\e) = 

[Die)]22 = {gi\D{e)\gi) = {gi\e-kgi) = (g^lgi) = 1 

So, the matrix representation of the identity is D{e) = shouldn't be surprising 

that the identity element is represented by the identity matrix. 

Next we find the representation of D{gi): 

[Dig,)]ii = {e\D{g,)\e) = {e\gi ^ e) = {e\gi) = 

[D{gi)]i2 = {e\D{gi)\gi) = {e\gi -k gi) = {e\e) = 1 

[D{gi)]2i = {gi\D{gi)\e) = {g^\g^^ e) = {gi\gi) = 1 

[D{gi)]22 = {gi\D{gi)\gi) = {g^lg^ -k g^) = {gi\e) = 

So, the matrix representation of gi is D{gi) = ^ ^ . It is straightforward to check that 
this is a true representation. 



ly ^0 ly ^0 1 



ij \i oj \1 



1 oy ^0 ly ^1 



Instead of considering the next obvious example, the group of order 3, consider the group 
^3 from above (the multiplication table is on page 19 1. The identity representation D{e) is 
easy — it is just the 6x6 identity matrix. We encourage you to work out the representation 
of D{gi) on your own, and check to see that it is 

/O 



1 









1 











1 



(2.2) 
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All 6 matrices can be found this way, and multiplying them out will confirm that they do 
indeed satisfy the group structure of S^. 



2.1.5 Reducibility and Irreducibility — A Preview 



You have probably noticed that equation ( 2.1[ > will always produce a set of n x n ma- 



trices, where n is the order of the group. There is actually a name for this particular 
representation. The n x n matrix representation of a group of order n is called the Regular 
Representation. More generally, them x m matrix representation of a group (of any order) is 
called the m-dimensional representation. 

But, as we have seen, there is more than one representation for a given group (in fact, 
there are an infinite number of representations). 

One thing we can immediately see is that any group that is Non-Abelian cannot have a 
1x1 matrix representation. This is because scalars (1x1 matrices) always commute, 
whereas matrices in general do not. 



We saw above in equation (2.2 1 that we can represent the group Sn by n\ x n\ matrices. 
Or, more generally, we can represent any group using m x m matrices, were m equals 
order(G). This is the regular representation. But it turns out that it is usually possible to 
find representations that are "smaller" than the regular representation. 

To pursue how this might be done, note that we are working with matrix representations 
of groups. In other words, we are representing groups in linear spaces. We will therefore 
be using a great deal of linear algebra to find smaller representations. This process, of 
finding a smaller representation, is called Reducing a representation. Given an arbitrary 
representation of some group, the first question that must be asked is "is there a smaller 
representation?" If the answer is yes, then the representation is said to be Reducible. If 
the answer is no, then it is Irreducible. 

Before we dive into the more rigorous approach to reducibility and irreducibility, let's 
consider a more intuitive example, using S3. In fact, we'll stick with our three painted 
Easter eggs, R, O, and Y: 



1. e{ROY) = (ROY) 

2. gi{ROY) = (OYR) 

3. g2{R0Y) = (YRO) 

4. g^iROY) = (ORY) 

5. gi{ROY) = iYOR) 

6. g,{ROY) = (RYO) 
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We will represent the set of eggs by a column vector \E) = | O | . 

Now, by inspection, what matrix would do to \E) what gi does to {R0Y)1 In other words, 
how can we fill in the ?'s in 




to make the equality hold? A few moments thought will show that the appropriate matrix 
is 

^0 1 0\ (R^ 
10 

.1 o; v^, 

Continuing this reasoning, we can see that the rest of the matrices are 
D{e) = |o 1 j , D{g{) = | 1 | , D{g2) 



Digs) =1 , D{g,) = 1 , D{g, 








You can do the matrix multiplication to convince yourself that this is in fact a representa- 
tion of 5*3. 



So, in equation (2.2), we had a 6 x 6 matrix representation. Here, we have a new repre- 
sentation of consisting of 3 x 3 matrices. We have therefore "reduced" the representation. 
In the next section, we will look at more mathematically rigorous ways of reducing rep- 
resentations. 



2.1.6 Algebraic Definitions 

Before moving on, we must spend this section learning the definitions of several terms 
which are used in group theory. 

If H is a subset ofG, denoted H c G, such that the elements of H form a group, then we say that 
H forms a Subgroup ofG. We make this more precise with examples. 
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Example 7 



Consider (as usual) the group S3, with the elements labeled as before: 

1. go{ROY) = (ROY) 

2. gi{ROY) = (OYR) 

3. g2{R0Y) = (YRO) 

4. g^iROY) = (ORY) 

5. gi{ROY) = {YOR) 

6. gr^ROY) = (RYO) 

(where we are relabeling g^ = e for later convenience). The multiplication 



table is given on page 19 



Notice that {go,gi,g2} form a subgroup. You can see this by noticing that the 
upper left 9 boxes in the multiplication table (the go,gi, g2 rows and columns) 
all have only ^fo's, gi's, and g^s. So, here is a group of order 3 contained in S3. 



Example 8 

Consider the subset of 53 consisting of go and 5^3 only. Both g^ and g^ are their 
own inverses, so the identity exists, and the group is closed. Therefore, we can 
say that {go,g3} C is a subgroup of 6*3. 

In fact, if you write out the multiplication table for go and g^ only, you will 
see that it is exactly equivalent to the group of order 2 considered above. This 
means that we can say that S3 contains the group of order 2 (and we know from 
last time that there is only one such group, though there are an infinite number 
of representations of it). The way we understand this is that the abstract entity 
5*3, of which there is only one, contains the group of order 2, of which there 
is only one. However, the representations of S3, of which there are an infinite 
number, will each contain the group of order 2 (of which there are also an 
infinite number of representations). 



Example 9 

Notice that the sets {5^0, fl's}/ {go, g^}, arid {go, ^75} (all C 5*3), are all the same as 
the group of order 2. This means that 5*3 actually contains exactly three copies 
of the group of order 2 in addition to the single copy of the group of order 3. 

Again, this is speaking in terms of the abstract entity S3. We can see this 
through the "lens" of representation by the fact that any representation of S3 
will contain three different copies of the group of order 2. 
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Example 10 

As a final example of subgroups, there are two subgroups of any group, no 
matter what the group. One is the subgroup consisting of only the identity, 
{go} C G. All groups contain this, but it is never very interesting. 

Secondly, VG, G C G, and therefore G is always a subgroup of itself. We call 
these subgroups the "trivial" subgroups. 

We now introduce another important definition. 
If G is a group, and H is a subgroup ofG(Hc G), then 

• The set gH — {gi(h\h e H} is called the Left Coset ofH in G 

• The set Hg = {/i ★ g\h e H} is called the Right Coset ofH in G 

There is a right (or left) coset for each element g E G, though they are not necessarily all 
unique. This definition should be understood as follows; a coset is a set consisting of the 
elements of H all multiplied on the right (or left) by some element of G. 

Example 11 

For the subgroup H = {go, gi} <Z discussed above, the left cosets are 





{9o'*'9o,9o 




= {90: 


^91} 


9i{9o,gi} = 


{91-^90,91 




= {91,92} 


92{9o,gi} = 


{92*9o,g2 


*9i} 


= {92: 


• 90} 


93{9o,9i} = 


{93-*'9o,93 




= {93: 


■ 94} 


9i{9o,9i} = 






= {^4. 


• 95} 


95{9o,9i} = 


{95-^90,95 


-^9i} 


= {^5: 


,93} 



So, the left cosets of {go, gi} in S3 are {go, gi}, {5-1, 5-2}, {52, go}, {93, 9^, {fl'4, 95}, 
and {^5,^3}- 

We can now understand the following definition. H is a Normal Subgroup of G if V/i G H, 
g^^ -kh-k g e H. Or, in other words, if H denotes the subgroup, it is a normal subgroup if 
gH — Hg, which says that the left and right cosets are all equal. 

As a comment, saying gH and Hg are equal doesn't mean that each individual element 
in the coset gH is equal to the corresponding element in Hg, but rather that the two 
cosets contain the same elements, regardless of their order. For example, if we had the 
cosets {gi, gj, gk} and {gj, gh, gi}, they would be equal because they contain the same three 
elements. 
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This definition means that if you take a subgroup H of a group G, and you multiply the 
entire set on the left by some element of g E G, the resulting set will contain the exact same 
elements it would if you had multiplied on the right by the same element g E G. Here is 
an example to illustrate. 

Example 12 

Consider the order 2 subgroup {go,g3} C S^. Multiplying on the left by, say, 
g4, gives 

94 * {go, 93} = {g4 * go, g^ * gs} = {g^, g2} 
And multiplying on the right by 514 givs 

{fl'o,fl'3}*9'4 = {9o*9A,g3*g4} ^ {94, gi} 

So, because the final sets do not contain the same elements, {(74, g2} 7^ {g^, gi}, 
we conclude that the subgroup {go,g3}is not a normal subgroup of S3. 

Example 13 

Above, we found that {go, 91,92} C 5'3 is a subgroup of order 3 in 5*3 . To use a 
familiar label, remember that we previously called the group of order 3 (Z3, +). 
So, dropping the '+', we refer to the group of order 3 as Z3. Is this subgroup 
normal? We leave it to you to show that it is. 

Example 14 

Consider the group of integers under addition, (Z, +). And, consider the sub- 
group Ze^en C Z, the even integers under addition (we leave it to you to show 
that this is indeed a group). 

Now, take some odd integer Uodd and act on the left: 

nodd + '^even = {^odd + 0, Uodd ± 2, Uodd ± 4, . . .} 

and then on the right: 

nodd = {0 + Uodd, ±2 + Uodd, ±4 + Uodd, ■ ■ ■} 

Notice that the final sets are the same (because addition is commutative). So, 
1'even C Z is a normal subgroup. 

With a little thought, you can convince yourself that all subgroups of Abelian groups are 
normal. 

If G is a group and H d G is normal, then the Factor Group of H in G, denoted G/H (read "G 
mod H"), is the group with elements in the set G/H = {gH\g e G}. The group operation ★ is 
understood to he 

{9iH)-k{gjH) = {gi-kgj)H 
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Example 15 

Consider again l^even- Notice that we can call l^even — 2Z because 2Z = 
2{0, ±1, ±2±3, . . .} = {0, ±2, ±4, . . .} = Ze^en- We know that 2Z c Z is normal, 
so we can build the factor group Z/2Z as 

Z/2Z = {0 + 2Z, ±1 + 2Z, ±2 + 2Z, . . .} 

But, notice that 

^odd + 2Z = Zodd 

So, the group Z/2Z only has 2 elements; the set of all even integers, and the set 
of all odd integers. And we know from before that there is only one group of 
order 2, which we denote Z2. So, we have found that Z/2Z = Z2. 

You can also convince yourself of the more general result 

Z/nZ = Z„ 

Example 16 

Finally, we consider the factor groups G/G and G/e. 

• G/G — The set G = {^fo, fi'i, 5'2, • • •} will be the same coset for any element 
of G multiplied by it. Therefore this factor group consists of only one 
element, and therefore G/G = e, the trivial group. 

• G/e — The set {e} will be a unique coset for any element of G, and there- 
fore G/e = G. 

Something that might help you understand factor groups better is this: the factor group 
G/H is the group that is left over when everything in is "collapsed" to the identity 
element. Think about the above examples in terms of this picture. 

IfG and H are both groups (not necessarily related in any way), then we can form the Product 
Group, denoted K = G ® H, where an arbitrary element of K is {gi, hfj. If the group operation 
ofG is -kor dnd the group operation of H is -kn, then two elements of K are multiplied according 
to the rule 

{9i, hj) -kK {gk, hi) = {gi -kc 9k, hj -kn hi) 
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2.1.7 Reducibility Revisited 



Now that we understand subgroups, cosets, normal subgroups, and factor groups, we 
can begin a more formal discussion of reducing representations. Recall that in deriving 
equation ( ZT] >, we made the designation 

9o |ei) gi |e2) 92 \h) etc. 

This was used to create an order(G')-dimensional Euclidian space which, while not having 
any "physical" meaning, and while obviously not possessing any structure similar to the 
group, was and will continue to be of great use to us. 

We have an n-dimensional space spanned by the orthonormal vectors | (70) , , . . . , \gn~i), 
where qq is understood to always refer to the identity element. This brings us to the first 
definition of this section. For a group G = {go, gi, 92, ■ ■ call the Algebra ofG the set 

n-1 



C[G] = |^Ci|gi) CiGCVij 

i=0 ^ 



In other words, C[G] is the set of all possible linear combinations of the vectors \gi) with 
complex coefficients. 

We could have defined the algebra over Z or M, but we used C for generality at this point. 
Addition of two elements of C[G] is merely normal addition of linear combinations, 

n— 1 n—1 n—1 

^Ci\gi) + ^di\gi) = ^(q + di)\g^) 

i=0 i=0 i=0 



This definition amounts to saying that, in the n-dimensional Euclidian space we have cre- 
ated, with n = order(G), you can choose any point in the space with complex coefficients, 
and this will correspond to a particular linear combination of elements of G. 

Now that we have defined an algebra, we can talk about group actions. Recall that the 
gi's don't act on the \gj)'s, but rather the representation D{gi) does. We define the action 
D(gi) on an element of C[G] as follows: 

n-1 

^(9i) - ^Cjlgj) = D{g,i) ■ (colfi-o) + Cil^-i) H h c„_i|5f„_i)) 

j=0 

n-1 

= Co\gi-kgo) + ci\gi'kgi) -\ h Cn-i\gi -k g^-i) = '^Cj\gi-kgj) 

j=0 



Previously, we discussed how elements of a group act on each other, and we also talked 
about how elements of a group act on some other object or set of objects (like three painted 
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eggs). We now generalize this notion to a set of q abstract objects a group can act on, 
denoted M = {mo, rrii, 1712, ... , mq_i}. Just as before, we build a vector space, similar to 
the one above used in building an algebra. The orthonormal vectors here will be 

mo^\mo), mi |mi), ... mg_i -> |mg_i) 

This allows us to understand the following definition. 

The set 



CM 



^ i=0 



Ci G C Vi 



is called the Module of M. (We don't use the square brackets here to distinguish modules 
from algebras). In other words, the space spanned by the \mi) is the module. 

Example 17 

Consider, once again, 53. However, we generalize from three eggs to three 
"objects" mo, mi, and m2. So, CM is all points in the 3-dimensional space of 
the form co|mo) + ci|mi) + C2|m2) with q G C Wi. 

Then, operating on a given point with, say, gi gives 

gi{co\mo) + ci|mi) + C2|m2)) = (cd^iimo) + cil^fimi) + C2|5'im2)) 



and from the multiplication table on page 19 we know 

gimo = mi, giuii = mo, gim2 = m2 

So, 

(cols-iw^o) + Cilfi-imi) + C2|fi'im2)) = (co|mi) + Ci|mo) + C2|m2)) 

= Ci|mo) + Co|mi) + C2|m2) 

So, the effect of gi was to swap ci and cq. This can be visualized geometrically 
as a reflection in the cq = ci plane in the 3-dimensional module space. We can 
visualize every element of G in this way. They each move points around the 
module space in a well-defined way. 

This allows us to give the following definition. If CV is a module, and CW is 
a subspace of CV that is closed under the action of G, then CW is an Invariant 
Subspace o/CV. 

Example 18 

Working with ^3, we know that 5*3 acts on a 3-dimensional space spanned by 

|mo) = (1,0,0)^, |mi) = (0,1,0)^, and |m2) = (0, 0, 1)^ 
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Now, consider the subspace spanned by 

c(|mo) + |mi) + Ims)) (2.3) 

where c G C, and c ranges over all possible complex numbers. If we restrict 
c to M, we can visualize this more easily as the set of all points in the line 
through the origin defined hy X{i + j + k) (where A G M). You can write out the 
action of any element of 5*3 on any point in this subspace, and you will see that 
they are unaffected. This means that the space spanned by ( |2.3[ > is an invariant 
subspace. 

As a note, all modules CV have two trivial invariant subspaces. 

• CV is a trivial invariant subspace of CV 

• Ce is a trivial invariant subspace of CV 

Finally, we can give a more formal definition of reducibility. If a representation D of a group 
G acts on the space of a module CM, then the representation D is said to he Reducible z/CM 
contains a non-trivial invariant subspace. If a representation is not reducible, it is Irreducible. 

We encouraged you to write out the entire regular representation of 6*3 above. If you have 
done so, you may have noticed that every 6x6 matrix appeared with non-zero elements 
only in the upper left 3x3 elements, and the lower right 3x3 elements. The upper 
right and lower left are all 0. This means that, for every element of 6*3, there will never 
be any mixing of the first 3 dimensions with the last 3. So, there are two 3-dimensional 
invariant subspaces in the module for this particular representation of 5*3 (the regular 
representation). 

We can now begin to take advantage of the fact that representations live in linear spaces 
with the following definition. 

If V is any n-dimensional space spanned by n linearly independent basis vectors, and U and W 
are both subspaces ofV, then we say that V is the Direct Sum ofU and W if every vector v eV 
can be written as the sum v = u + w, where u e U and w e W, and every operator X acting on 
elements ofV can be separated into parts acting individually on U and W. The notation for this 
isV = U®W. 

In order to make this clearer, if is annx n matrix, it is the direct sum ofm x m matrix 
and k X k matrix Bk, denoted X„ = © B^, iff X is in Block Diagonal /orm, 

^""lo B,) 

where n = m + k, and Am, Bk, and the O's are understood as matrices of appropriate dimension. 
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We can generalize the previous definition as follows. 



a 



where n = ni + n2 + ■ ■ ■ + n^. 
Example 19 

1 1 -2\ 
Let ^3= I -1 5 TT L and let = 
-17 4 11/ 

/I 2 

3 4 

52 0^3= 


\0 



B. 



1 2 
3 4 




1 



712 



V 



Then, 



\ 



1 -2 



-1 5 TT 

-17 4 11/ 



\ 




To take stock of what we have done so far, we have talked about algebras, which are the 
vector spaces spanned by the elements of a group, and about modules, which are the vec- 
tor spaces that representations of groups act on. We have also defined invariant subspaces 
as follows: Given some space and some group that acts on that space, moving the points 
around in a well-defined way, an invariant subspace is a subspace which always contains 
the same points. The group doesn't remove any points from that subspace, and it doesn't 
add any points to it. It merely moves the points around inside that subspace. Then, we 
defined a representation as reducible if there are any non-trivial invariant subspaces in 
the space that the group acts on. 

And what this amounts to is the following: a representation of any group is reducible if it 
can be written in block diagonal form. 

But this leaves the question of what we mean when we say "can be written". How can 
you "rewrite" a representation? This leads us to the following definition. Given a matrix 
D and a non-singular matrix S, the linear transformation 

D^D' ^ S-^DS 

is called a Similarity Transformation. 

Then, we can give the following definition. Two matrices related by a similarity transforma- 
tion are said to be Equivalent. 

Because similarity transformations are linear transformations, if D{G) is a representation 
of G, then so is S~^DS for literally any non-singular matrix S. To see this, if gi ★ gj — g^, 
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then D{gi)D{gj) = D{gk), and therefore 

S-^D{g,)S ■ S-'D{g,)S = S-'D{g,)D{g^)S = S-'D{g,)S 

So, if we have a representation that isn't in block diagonal form, how can we figure out 
if it is reducible? We must look for a matrix S that will transform it into block diagonal 
form. 

You likely realize immediately that this is not a particularly easy thing to do by inspection. 
It turns out that there is a very straightforward and systematic way of taking a given rep- 
resentation and determining whether or not it is reducible, and if so, what the irreducible 
representations are. 

However, the details of how this can be done, while very interesting, are not necessary 
for the agenda of these notes. Therefore, for the sake of brevity, we will not pursue them. 
What is important is that you understand not only the details of general group theory and 
representation theory (which we outlined above), but also the concept of what it means 
for a group to be reducible or irreducible. 



2.2 Introduction to Lie Groups 



In section 2.1 we considered groups which are of finite order and discrete, which allowed 



us to write out a multiplication table. 

Here, however, we examine a different type of group. Consider the unit circle, where 
each point on the circle is specified by an angle 9, measured from the positive a;-axis. 




We will refer to the point at = as the "starting point" (like ROY was for the Easter 
eggs). Now, just as we considered all possible orientations of (ROY) that left the eggs 
lined up, we consider all possible rotations the wheel can undergo. With the eggs there 
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were only 6 possibilities. Now however, for the wheel there are an infinite number of 
possibilities for 9 (any real number G [0, 2n)). 

And note that if we denote the set of all angles as G, then all the rotations obey closure 
(9i + 02 = 03 EG, y0i,02 e G), associativity (as usual), identity {Q + = + Q = 0), and 
inverse (the inverse of is —0). 

So, we have a group that is parameterized by a continuous variable 0. So, we are no longer 
talking about gi's, but about g{0). 

Notice that this particular group (the circle) is Abelian, which is why we can (temporarily) 
use addition to represent it. Also, note that we obviously cannot make a multiplication 
table because the order of this group is oo. 

One simple representation is the one we used above: taking and using addition. A 
more familiar (and useful) representation is the Euler matrix g{0) = ^ *^gjj^^ cos^) with 
the usual matrix multiplication: 



cos 01 sin di\ ( cos 02 sin 02 \ 

— sin 01 cos 01 J sin 02 cos 02 J 

cos 01 COS 02 — sin 0i sin 02 cos 0i sin 02 + sin 0i cos 02 \ [-^ 

- sin 01 cos 02 — cos 0i sin 02 — sin 0i sin 02 + cos 0i cos 02/ 



(2.6) 



/ cos(ei + e2) sin(^i + 02 
V- sin(ei + ^^2) cos(ei + ^^2)^ 

This will prove to be a much more useful representation than with addition. 

Groups that are parameterized by one or more continuous variables like this are called 
Lie Groups. Of course, the true definition of a Lie group is much more rigorous (and com- 
plicated), and that definition should eventually be understood. However, the definition 
we have given will suffice for the purposes of these notes. 



2.2.1 Classification of Lie Groups 

The usefulness of group theory is that groups represent a mathematical way to make 
changes to a system while leaving something about the system unchanged. For example, 
we moved (ROY) around, but the structure "3 eggs with different colors lined up" was 
preserved. With the circle, we rotated it, but it still maintained its basic structure as a 
circle. It is in this sense that group theory is a study of Symmetry. No matter which of 
"these" transformations you do to the system, "this" stays the same — this is symmetry. 



To see the usefulness of this in physics, recall Noether's Theorem (section 1.1.21. When 



you do a symmetry transformation to a Lagrangian, you get a conserved quantity. Think 
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back to the Lagrangian for the projectile ( |1.2 |. The transformation x — > x + e was a sym- 
metry because e could take any value, and the Lagrangian was unchanged (note that e 
forms the Abelian group (M, +)). 

So, given a Lagrangian, which represents the structure of a physical system, a symme- 
try represents a way of changing the Lagrangian while preserving that structure. The 
particular preserved part of the system is the conserved quantity j we discussed in sec- 



tions [LL2]and 1.1.6 And as you have no doubt noticed, nearly all physical processes are 
governed by Conservation Laws: conservation of momentum, energy, charge, spin, etc. 

So, group theory, and in particular Lie group theory, gives us an extremely powerful way 
of understanding and classifying symmetries, and therefore conserved charges. And be- 
cause it allows us to understand conserved charges, group theory can be used to under- 
stand the entirety of the physics in our universe. 

We now begin to classify the major types of Lie groups we will be working with in these 
notes. To start, we consider the most general possible Lie group in an arbitrary number of 
dimensions, n. This will be the group that, for any point p in the n-dimensional space, can 
continuously take it anywhere else in the space. All that is preserved is that the points in 
the space stay in the space. This means that we can have literally any nxn matrix, or linear 
transformation, so long as the matrix is invertible (non-singular). Thus, in n dimensions 
the largest and most general Lie group is the group of all n x n non-singular matrices. We 
call this group GL{n), or the General Linear group. The most general field of numbers 
to take the elements of GL{n) from is C, so we begin with GL{n, C). This is the group of 
all n X n non-singular matrices with complex elements. The preserved quantity is that all 
points in C" stay in C". 



The most obvious subgroup of GL{n,C) is GL{n,R 
matrices with real elements. This leaves all points in \ 



or the set of all n x n invertible 
^inM". 



To find a further subgroup, recall from linear algebra and vector calculus that in n di- 
mensions, you can take n vectors at the origin such that for a parallelepiped, we could 
obtain 
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Then, if you arrange the components of the n vectors into the rows (or columns) of a 
matrix, the determinant of that matrix will be the volume of the parallelepiped. 

So, consider now the set of all General Linear transformations that transform all vectors 
from the origin (or in other words, points in the space) in such a way that the volume of 
the corresponding parallelepiped is preserved. This will demand that we only consider 
General Linear matrices with determinant 1. Also, the set of all General Linear matrices 
with unit determinant will form a group because of the general rule det | A .B| = det j^lj • 
det \B\. So, if det \A\ — 1 and det \B\ — 1, then det |A • S| = 1. We call this subgroup of 
GL{n, C) the Special Linear group, or SL{n, C). The natural subgroup of this is SL{n, M). 
This group preserves not only the points in the space (as GL did), but also the volume, as 
described above. 

Now, consider the familiar transformations on vectors in n-dimensional space of gen- 
eralized Euler angles. These are transformations that rotate all points around the ori- 
gin. These rotation transformations leave the radius squared (r^) invariant. And, be- 
cause = • f, if we transform with a rotation matrix R, then f ^ f' — Rf, and 
_^ ^/T _ fT^T ^ gQ ^/T . ^/ _ fT^T . g^^.^ said, we are demanding that the 

radius squared be invariant under the action of R, and so we demand f^i?^ ■ Rf = f'^ ■ f. 
So, the constraint we are imposing is R^ -R = I, which implies R^ = R~^. This tells us that 
the rows and columns of R are orthogonal. Therefore, we call the group of generalized 
rotations, or generalized Euler angles in n dimensions, 0{n), or the Orthogonal group. 
We don't specify C or R here because it will be understood that we are always talking 
about R. 

Also, note that because det \R^ ■ R\ = det |I| ^ (det \R\f = 1 ^ det \R\ = ±1. We again 
denote the subgroup with det|i?| = +1 the Special Orthogonal group, or SO{n). To 
understand what this means, consider an orthogonal matrix with determinant —1, such 
as 



This matrix is orthogonal, and therefore is an element of the group 0(3), but the determi- 
nant is —1. This matrix will take the point (x, y, z)^ to the point {x, y, —zY. This changes 
the handedness of the system (the right hand rule will no longer work). So, if we limit 
ourselves to SO{n), we are preserving the space, the radius, the volume, and the handed- 
ness of the space. 

For vectors in C space, we do not define orthogonal matrices (although we could). In- 
stead, we discuss the complex version of the radius, where instead of = • f, we have 
^2 _ . ^here the dagger denotes the Hermitian conjugate, = (r*)^, where * denotes 
complex conjugate. 

So, with the elements in R being in C, we have f Rf, and f^ f^R'^. So, f^-f ^ f^R^-Rf, 
and by the same argument as above with the orthogonal matrices, this demands that 
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R^^ ■ R = I, or R) = R^^. We denote such matrices Unitary, and the set of all such n x n 
invertible matrices form the group U{n). Again, we understand the unitary groups to 
have elements in C, so we don't specify that. And, we will still have a subset of unitary 
matrices R with det = 1 called SU (n), the Special Unitary groups. 

We can summarize the hierarchy we have just described in the following diagram: 



GLCn.C) 




GLCn.R) UCn) SLCn.C) 




SOCn) 



We will now describe one more category of Lie groups before moving on. We saw above 
that the group SO{n) preserves the radius squared in real space. In coordinates, this 

means that f'^ = xl + x^-i h x^, or more generally the dot product x-y = xiyi + X2y2 + 

• • • + Xnyn is preserved. 

However, we can generalize this to form a group action that preserves not the radius 
squared, but the value (switching to indicial notation for the dot product) x'^ya = —Xiyi — 
X2y2 x^ym+Xm+iym+1^ hx^+nym+n- We Call the group that preserves this quantity 
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SO{m, n). The space we are working in is still but we are making transformations 

that preserve something different than the radius. 

Note that SO{m,n) will have an SO{m) subgroup and an SO{n) subgroup, consisting of 
rotations in the first m and last n components separately. 

Finally, notice that the specific group of this type, 50(1, 3), is the group that preserves the 
values^ = — a^i?/i+a^2?/2+2^3?/3+3:4?/4, or written more suggestively, = —cH'^-\-x'^-\-y'^-\-z^. 
Therefore, the group 5*0 (1, 3) is the Lorentz Group. Any action that is invariant under 
5*0(1, 3) is said to be a Lorentz Invariant theory (as all theories should be). We will find 
that thinking of Special Relativity in these terms, rather than in the terms of Part I, will be 
much more useful. 

It should be noted that there are many other types of Lie groups. We have limited our- 
selves to the ones we will be working with in these notes. 



2.2.2 Generators 



Now that we have a good "birds eye view" of Lie groups, we can begin to pick apart the 
details of how they work. 

As we said before, a Lie group is a group that is parameterized by a set of continuous 
parameters, which we call for i = 1, . . . , n, where n is the number of parameters the 
group depends on. The elements of the group will then be denoted g{ai). 

Because all groups include an identity element, we will choose to parameterize them in 
such a way that g{oii)\^.^Q — e, the identity element. So, if we are going to talk about 
representations, Dn{g{(y.i))\^^^ — 1, where I is the n x n identity matrix for whatever 
dimension (n) representation we want. 

Now, take ai to be very small with 5ai « 1. So, Dn{g{0 + Sai)) can be Taylor expanded: 

Dn{g{Sai)) = I + 5ai — \^^^^ + ■■■ 

The terms I are extremely important, and we give them their own expression: 



dai \ai=0 

Xi = -I 



(2.7) 



ai=0 

(we have included the — i in order to make Hermitian, which will be necessary later). 
So, the representation for infinitesimal 5ai is then 

Dn{5ai) — iSaiXi + • • • 
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(where we have switched our notation from Dn{g{a)) to Dn{a) for brevity). 
The Xi's are constant matrices which we will determine later. 

Now, let's say that we want to see what the representation will look like for a finite value 
of ttj rather than an infinitesimal value. A finite transformation will be the result of an 
infinite number of infinitesimal transformations. Or in other words, ctj = NSai as A?^ ^ oo. 
So, 5ai — j^, and an infinite number of infinitesimal transformations is 

lim + lim h+i^X^f 

If you expand this out for several values of N, you will see that it is exactly 

We call the X/s the Generators of the group, and there is one for each parameter required 
to specify a particular element of the group. For example, consider 5'0(3), the group 
of rotations in 3 dimensions. We know from vector calculus that an element of 5'0(3) 
requires 3 angles, usually denoted cf), and 'ip. Therefore, 5*0(3) will require 3 generators, 
which will be denoted Xq, X^, and X^. We will discuss how the generators can be found 
soon. 

In general, there will be several (in fact, infinite) different sets of X/s that define a given 
group (just as there are an infinite number of representations of any finite group). What 
we will find is that up to a similarity transformation, a particular set of generators defines 
a particular representation of a group. 

So, Dn{ai) = e*"'"^' for any group (the i index in the exponent is understood to be summed 
over all parameters and generators). The best way to think of the parameter space for the 
group is as a vector space, where the generators describe the behavior near the identity, 
but form a basis for the entire vector space. By analogy, think of the unit vectors i, j, 
and k in R^. They are defined at the origin, but they can be combined with real num- 
bers/parameters to specify any arbitrary point in M^. In the same way, the generators are 
the "unit vectors" of the parameter space (which in general is a much more complicated 
space than Euclidian space), and the parameters (like 9, (j), and ijj) specify where in the 
parameter space you are in terms of the generators. That point in the parameter space 
will then correspond to a particular element of the group. 

We call the number of generators of a group (or equivalently the number of parameters 
necessary to specify an element), the Dimension of the group. For example, the dimen- 
sion of SO{3) is 3. The dimension of SO{2) (rotations in the plane) however is only 1 (only 
9 is needed), so there will be only one generator. 
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2.2.3 Lie Algebras 



In section 2.1 we discussed algebras. An algebra is a space spanned by elements of the 
group with C coefficients parameterizing the Euclidian space we defined. Obviously we 
can't define an algebra in the same way for Lie groups, because the elements are continu- 
ous. But, as discussed in the last section, a particular element of a Lie group is defined by 
the values of the parameters in the parameter space spanned by the generators. We will 
see that the generators will form the algebras for Lie groups. 

Consider two elements of the same group with generators Xj, one with parameter values 
ai and the other with parameter values Pi. The product of the 2 elements will then be 
giOiX,gi/3jXj Because we are assuming this is a group, we know that the product must be 
an element of the group (due to closure), and therefore the product must be specified by 
some set of parameters 5k, so e*°'^'e*^^^^ = e*'^*'^*. Note that the product won't necessarily 
simply be e*"*"^'e*^J"^J = e^i^'^^i+i^j^j) because the generators are matrices and therefore 
don't in general commute. 

So, we want to figure out what 5i will be in terms of and We do this as follows. 

iSiXk = ln(e*'^'=^'=) = ln(e*"»^'e^^^O = + e*"»^'e^^^^ - 1) = ln(l + x) 

where we have defined x = g««i^ig/3j^j _ x. We will proceed by expanding only to second 
order in and Pj, though the result we will obtain will hold at arbitrary order. By Taylor 
expanding the exponential terms, 

,..-V.,.,>.. - 1 = (1 + ».A-, + l(.a.X.r + ■ ■ ■ )(1 + W,X, + l(.p,X,r + ...)-! 

= 1 + iP^Xj - -iPjXjf + iaiX, - aiXipjXj - -{aiX,)^ - 1 
= i{aiX, + PjX,) - aiXiPjX, - ^ ((a,X,)' + {PjX.y) 

_ ^ 2 3 4 

Then, using the general Taylor expansion ln(l + x) = ^ ~ \ + % — ^ + ''' / ^rid again 
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keeping terms only to second order in a and /3, we have 



X 



X — 



iip-iXi + PjXj) — ttiXiPjXj — 



{a,X,Y + iP^X,) 



- {aiX, + (3jXj){aiX, + f3jXj 

1 



i{aiXi + (3jXj) - aiXi(3jXj 



+ - 



{aiX.f + iPjX.f + aiPjiXiX, + X,X- 



So finally we can see 



or 



i{aiXi + (3jX^) + -ail3j{X^Xi - XiX^) 
i{aiXi + (3jXj) - ^a,Pj[X„Xj] 
i{aiXi + pjXj) - ^[aiXi,pjXj] 



i6kXk = i{aiXi + (3jXj) - - [atXi, Xj] 



(2.8) 



Equation (2.8 1 is called the Baker-Campbell-Hausdorff formula, and it is one of the most 
important relations in group theory and in physics. Notice that, if the generators com- 
mute, this reduces to the normal equation for multiplying exponentials. You can think of 
equation ( 2.8[ > as the generalization of the normal exponential multiplication rule. 



Now, it is clear that the commutator [Xj, Xj] must be proportional to some linear combi- 
nation of the generators of the group (because of closure). So, it must be the case that 



(2.9) 



for some set of constants fijk- These constants are called the Structure Constants of the 
group, and if they are completely known, the commutation relations between all the gen- 
erators are known, and so the entire group can be determined in any representation you 
want. 

The generators, under the specific commutation relations defined by the structure con- 
stants, form the Lie Algebra of the group, and it is this commutation structure which 
forms the structure of the Lie group. 
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2.2.4 The Adjoint Representation 



We will talk about several representations for each group we discuss, but we will men- 
tion a very important one now. We mentioned before that the structure constants fijk 
completely determine the entire structure of the group. 

We begin by using the Jacobi identity, 

[X„ [Xj, Xk]] + [X„ [Xk, X,]] + [Xk, [X„ X,]] = (2.10) 

(if you aren't familiar with this identity, try multiplying it out. You will find that it is 



identically true — all the terms cancel exactly). But, from equation (2.9), we can write 

[^i, i^j^^k]] = ifjka[^i,^a] = ifjkafiab^b 



Plugging this into (2.10) we get 



ifjkafiabXb + ifkiafjab^b + ifijafkab^b — 
^ (y fjkafiab ~l~ fkiafjab ~l~ fijafkab^'^-^b 

fjkafiab ~l~ fkiafjab ~l~ fijafkab (2.11) 

So, if we define the matrices 

[T'^]fe, = -tfabc (2.12) 

then it is easy to show that ( 2.11[ ) leads to 

[T'',T']=lfabcT'' 

So, the structure constants themselves form a representation of the group (as defined by 
( |2.12[ ). We call this representation the Adjoint Representation, and it will prove to be 
extremely important. 



Notice that the indices labeling the rows and columns in (2.12) each run over the same 
values as the indices labeling the T matrices. This tells us that the adjoint representation 
is made of n x n matrices, where n is the dimension of the group, or the number of 
parameters in the group. For example, S0{3) requires 3 parameters to specify an element 
{6, (f), ip), so the adjoint representation of S0{3) will consist of 3 x 3 matrices. S0{2) on the 
other hand is Abelian, and therefore all of the structure constants vanish. Therefore there 
is no adjoint representation of 5*0(2). 

We now go on to consider several specific groups in detail. 
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2.2.5 S0{2) 



We start by looking at an extremely simple group, S0{2). This is the group of rotations in 
the plane that leaves = x^+y^ = [x y)- = iP^ -v invariant. So for some generator X 

(which we will now find) of S0{2), v R{9)v = e^^^v, and v'^ v'^e''^^^ . So, expanding 
to first order only, yT^iex^^iex^ ^ ^t^^j^ _^ -q^t _^ iOx)v = ■ v + v^iO{X + X'^)v. And 
because we demand that be invariant, we demand that X + X'^ = ^ X = —X^. So, 
X must be antisymmetric. Therefore we take 

^^1/0 1 



I \-l 

(the \ is included to balance the i we inserted in equation (2.7 1 to ensure that X is Hermi- 
tian). 

So, an arbitrary element of SO {2) will be 

1 o\ , . i\ 1 .2 A o\ 1 ^3 / 1 



cos 6 sin 6* 
— sin 6 cos ^ 

which is exactly what we would expect for a matrix describing rotations in the plane. 

Also, notice that because 50(2) is Abelian, the commutation relations trivially vanish 
([X, X] = 0), and so all of the structure constants are zero. 

Now that we have found an explicit example of a generator, and seen an example of how 
generators relate to group elements, we move on to slightly more complicated examples. 



2.2.6 50(3) 

We could easily generalize the argument from the proceeding section and find the genera- 
tors of 50(3) in the same way, but in order to illustrate more clearly how generators work, 
we will approach 50(3) differently by working backwards. Above, we found the gener- 
ators and used them to calculate the group elements. Here, we begin with the known 
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group elements of 5*0(3), which are just the standard Euler matrices for rotations in 3- 
dimensional space: 




sin cos 6 
1, 



(2.13) 
(2.14) 
(2.15) 



Now, recall the definition of the generators, equation ( 2.7) . We can use it to find the 
generators of 5*0(3), which we will denote Jx, Jy, and Jz- 



J X 



And similarly 



IdRM 



4>=o 



'0 
— sin cos I 
, — cos 6 sin < 



J 1/ 








^0 







) 














0=0 ^ 


^0 


-1 





J. 




You can plug these into the exponentials with t he ap propr iate p arameters {(f), if), or 9) and 
find that e"^'^'' , e^'^'^y, and e*"-^^ reproduce ( 2.13[ >, (2.14), and (2.15), respectively. 



Furthermore, you can multiply out the commutators to find 



\Jzi Jx\ ^Jy 



or 



which tells us that the structure constants for 50(3) are 

fijk ^ijk 



(2.16) 



where ej^^ is the totally antisymmetric tensor. The structure constants being non-zero is 
consistent with 5*0(3) being a non-Abelian group. 
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2.2.7 SU{2) 



We will approach SU (2) yet another way: by starting with the structure constants. It turns 
out they are the same as the structure constants for SO (3): 

fijk = ^ijk (2.17) 

To see why, recall that SU (2) are rotations in two complex dimensions. The most general 



form of such a matrix U G SU (2) is U = 




. The "Special" part of SU (2) demands 



that the determinant be equal to 1, or 

ad — be = 1 

and the "Unitary" part demands that U^^ = W. So, 




or in other words. 




where we demand |ap + |6p = 1. 

Both a and b are in C, and therefore have 2 real components each, so U has 4 real param- 
eters. The constraint |ap + |6p = 1 fixes one of them, leaving 3 real parameters, just like 
in S0{3). This is a loose explanation of why SU{2) and S0{3) have the same structure 
constants. They are both rotational groups with 3 real parameters. 

This also tells us that SU{2) will have 3 generators. 



2.2.8 SU{2) and Physical States 



T 



The elements of any Lie group (in a (i-dimensional representation consisting of d x d ma- 
trices) will act on vectors, just like the 3x3 matrices representing S^ acted on (i? O Y) 
in section 2.1.5 The most natural way to understand the space a Lie group acts on is to 
study the eigenvectors and eigenvalues of the generators of the representation you are 
using (the reason for this is beyond the scope of these notes at this point, but will be- 
come more clear as we proceed). These eigenvectors will obviously form a basis of the 
eigenspace of the physical space the group is acting on. 

Using similarity transformations, one or more of the generators of a Lie group can be di- 
agonalized. For now, trust us that with SU{2), it is only possible to diagonalize one of the 
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three generators at a time (you may convince yourself of this by studying the commuta- 
tion relations). We will call the generators of SU (2) J\ .P, and J^, and by convention we 
take to be the diagonal one. So, consequently, the eigenvectors of will be the basis 
vectors of the physical vector space upon which SU (2) acts. 

Now, we know that (whatever it is ... we don't know at this point) will in general 
have more than one eigenvalue. Let's call the greatest eigenvalue of (whatever it is) 
j, and the eigenvectors of will be denoted |j; m) (the first j is merely a label — the 
second value describes the vector), where m is the eigenvalue of the eigenvector. The 
eigenvector corresponding to the greatest eigenvalue j will obviously then be So, 
J^\j;j) = or more generally J'^\j; m) = m\j; m). 

Now let's assume that we know |j; j). There is a trick we can employ to find the rest of 
the states. Define the following linear combinations of the generators: 

J^^l^{J'±^.f) (2.18) 
Now, using the fact that the SU (2) generators obey the commutation relations in equation 



(2.16 1, it is easy to show the following relations, 

[J^ J^] = ±J^ and [J+, J-] = (2.19) 
Notice that, because by definition are all Hermitian, we have 

(J-)t = j+ (2.20) 



Consider some arbitrary eigenvector \j; m). We know the eigenvalue of this will be m, so 
J^Ij; m) = m\j;m). But now let's create some new state by acting on \j; m) with either of 



the operators (2.181. The new state will be J^|j; m), but what will the eigenvalue be? 



Using the commutation relations in (2.19), 



r.r\]]m) = (±J± + J^r)\]]m) = (m± l)J±|j;m) 

So, the vector J+|j; m) is the eigenvector with eigenvalue m + 1, and the vector J~ |j; m) 
is the eigenvector with eigenvalue m — 1. 

If we have some arbitrary eigenvector \j; m), we can use to move up or down to the 
eigenvector with the next highest or lowest eigenvalue. For this reason, J"^ are called the 
Raising and Lowering operators. They raise and lower the eigenvalue of the state by one. 

Clearly, the eigenvector with the greatest eigenvalue j, with eigenvector cannot 
be raised any higher, so we define J^\j]j) = 0. We will see that there is also a lowest 
eigenvalue j', so we similarly define = 0. 

Now, considering once again \ We know that if we operate on this state with J^, we 
will get the eigenvector with the eigenvalue j — 1. But, we don't know exactly what this 
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state will be (knowing the eigenvalue doesn't mean we know the actual state). But, we 
know it will be proportional to \ — 1) . So, we set J" = Nj — 1), where Nj is the 



proportionality constant. To find Nj, we take the inner product (and using (2.20 1): 

{j;j\J^J-\j;j) = \N,\'{j;j-l\j;j-l) 

But we can also write 

{j;j\J^J-\j;j) = {j;j\iJ^J--J-J^)\j;j) = {j;j\[J^,J-]\j;j) 

= (j;jV'Ij;j)=j(j;jIj;j)=j (2.21) 



where we used the fact that J+ =0 to get the first equality, and ( 2.19| to get the third 
equality. We also assumed that \ is normalized. 



So, dZllI ) tells us 

(j; J - J - 1) = 1 ^ iV, ^ V7 (2.22) 
And our normalized state is therefore 

Repeating this to find iVj_i, we have 

|iV,_i|2(j;j-2|j;j-2) = {j;j-l\J+J-\j;j-l) 

VJ VJ 
= -{j;j\J^J^J-J-\j;j) 

= j{j;j\JHJ' + J-J^)J-\j;j) 

= -{j;j\{J^J'J- + J-'J-J'-J-)\j;j) 
J 

= j(j;j|(-[AJ1 + 2j[J+,J-])|j;j) 

= -{j;j\{-J' + 2jJ')\j;j) 
J 

= i(2/-j)=2j-l (2.23) 
J 



So, |iV,_i|2 = 2j - 1, or iV,_i = v/2j^. 

47 



We can continue this process, and we will find that the general result is 

iV,_fc = i= ^(2j-k){k + l) (2.24) 
and the general states are defined by 

\r,j-k) = ^{J-r\r,j) 



Notice that these expressions recover (2.221 and (2.23) for k = and k = 1, respectively. 
Furthermore, notice that when k = 2j, 

iV,_2, = -^v/(2j-2j)(2j + l) = 

So, the state — k) I = | j; —j) is the state with the lowest eigenvalue, and by defini- 
tion J-\]; -j) = 0. 

So, in a general representation of SU (2), we have 2j + 1 states: 

- 1, J - 2, . . . , -J + 2, -J + 1, -j} 

This therefore demands that j = f for some integer n. In other words, the highest eigen- 
value of an SU (2) eigenvector can be 0, |, 1, |, 2, etc. 

Furthermore, using these states, it is easy to show 

{j;m'\J^\j]m) = m5m',m 



{j;m'\J'^\j;m) = -^a/O' + m + l)(j - m)(5„/,^+i 



(2.25) 



2.2.9 SU{2)iorj = l 

We will skip the j = case because it is trivial (though we will discuss it later when we 
return to physics). 

For j = \, the two eigenvalues of will be \ and \ — ^ = —\- So, denoting the generator 
of SU (2) when j = | as Jfj^, we have 



1^ 



1/2 
1/2 
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Now, inverting (2.18) to get 



and J2 = — (J- - J^ 
v2 



and using the standard matrix equation [J"]m',m = {j,''^'\J"'\j,'m')> and the explicit prod- 
ucts in ( 2.25[ >, we can find (for example) 



1 1 

2' ~2 



J' 



1 1 

2'^2 



So [J^]ii = 0. Then, 



1 1 
2'~2 



J' 



1 1 
2' 2 



1 1 
2' ^2 



1 1 
2' ~2 



V2 



(J- + 



{J- + r 



1 1 

2' ~2 



1 1 
2' 2 



So [ji; 
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We can continue this to find all the elements for each generator for j = 1/2. The final 
result will be 



1 /O 1 



2 U 



a 



1 /O -I 



/2 - 2 U 



^ ,3 _iA 

2 ' 1/2-2 lo -1 



(2.26) 



where the a* matrices are the Pauli Spin Matrices. This is no accident! We will discuss this 
in much, much more detail later, but for now recall that we said that SU (2) is the group of 
transformations in 2-dimensional complex space (with one of the real parameters fixed, 
leaving 3 real parameters). We are going to see that SU (2) is the group which represents 
quantum mechanical spin, where j is the value of the spin of the particle. In other words, 
particles with spin 1/2 are described by the j = 1/2 representation (the 2x2 representation 



in (2.26)), and particles with spin 1 are described by the j = I representation, and so on. 
In other words, SU (2) describes quantum mechanical spin in 3 dimensions in the same 
way that 5*0(3) describes normal "spin" in 3 dimensions. We will talk about the physical 
implications, reasons, and meaning of this later. 

However, as a warning, be careful at this point not to think too much in terms of physics. 
You have likely covered SU (2) in great detail in a quantum mechanics course (though 
you may not have known it was called "SU (2)"), but the approach we are taking here 
has a different goal than what you have likely seen before. The properties of SU (2) we 
are seeing here are actually very, very specific and simplified illustrations of much deeper 
concepts in Lie groups, and in order to understand particle physics we must understand 
Lie groups in this way. So for now, try to fight the temptation to merely understand 
everything we are doing in terms of the physics you have seen before and learn this as 
we are presenting it: pure mathematics. We will focus on how it applies to physics later, 
in its fuller and more fundamental way than introductory quantum mechanics makes 
apparent. 
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2.2.10 SU{2) for j = 1 



You can follow the same procedure we used above to find 

/O 1 0\ /O -i \ /I \ 

= -= 1 1 , Jl^^ii , = (2.27) 

\0 1 0/ V2 • Q y \q Q 

Notice that only Jf is diagonal (as before), and that the eigenvalues are {1,0, —1}, or 
{h i - 1, i - 2 = -j} as we'd expect. 



2.2.11 SU{2) for Arbitrary j 



For any given j, we have 3 generators J^, J|, and J|, and for whatever dimension {d 
2j + 1) the physical space we are working in, we have d eigenvectors 








1 





|j;j-2) = 





1 



\V w \v 

with eigenvalues {j, j — 1, j — 2, ■ ■ ■ , — j}, respectively. 



/0\ 






Then, for any j, we can form the linear combinations = ^{Jj ±iJj). For example, for 
j — 1/2 these are 



7+ - _ 



1/0 l\ ^:(0 
2 U Oi ^ 2 U 



2v^V0 0. 



2 



and similarly 



J 



1/2 



_ ^ /O 



^ 1 

V2V0 



So, the two j = 1/2 eigenvectors will be ||; |) = and 



1 . _ i\ 

2' 2/ 



■^1 



1/2 
1 



/2 



1_ 1 

2' 2 

1 

2'~2 



J_ fO 1 
V2\0 

J_foo 
V2V 








. So, 
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and similarly 



1/2 



J 



1/2 



1_ 1 

2' 2 
1_ 1 
2'~2 



1_ 1 
2'~2 
1_ 1 
2' 2 



which is exactly what we would expect. 



The same calculation can be done for the j = 1 case and we will find the same results, 
except that the j = 1 state (the first eigenvector) can be lowered twice. The first time 
Jj^2 ^^ts it takes it to the state with eigenvalue 0, and the second time it acts it takes it 
to the state with eigenvalue —1. Acting a third time will destroy the state (take it to 0). 
Analogously, the lowest state, with eigenvalue j — —I can be raised twice. 

We can do the same analysis for any j = integer or half integer. 

As we said before, we interpret j as the quantum mechanical spin of a particle, and the 
group SU (2) describes that rotation. It is important to recognize that quantum spin is 
not a rotation through spacetime (it would be described by 5*0(3) if it was), but rather 
through the mathematically constructed spinor space. We will talk more about this space 
later. 



So for a given particle with spin, we can talk about both its rotation through physical 
spacetime using 5*0(3), as well as its rotation through complex spinor space using SU (2). 
Both values will be physically measurable and will be conserved quantities. The total 
angular momentum of the particle will be the combination of both spin and spacetime 
angular momentum. Again, we will talk much more about the spin of physical particles 
when we return to a discussion of physics. We only mention this now to give a preview 
of where this is going. However, spin is not the only thing SU (2) describes. We will also 
find that it is the group which governs the weak nuclear force (whereas U{1) describes 
the electromagnetic force, and SU{3) describes the strong force ... much, much more on 
this later). 



2.2.12 Root Space 



As a comment before beginning this section, it is likely that you will find this to be the 
most difficult section of these notes. The material here is both extremely difficult (espe- 
cially the first time it is encountered), and extremely important to the development of 
particle physics. In fact, this section is the most central to what will come later in these 
notes. If the contents are not clear you are encouraged to read this section multiple times 
until it becomes clear. It may also be helpful to study this section while looking closely 
at the examples in the sections forming the remainder of this part of these notes. They 
illustrate the point of where we are going with all of this. 
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We saw in the previous section that we can view the physical space that a group is acting 
on by using the eigenvectors of the diagonal generators as a basis. These eigenvectors 
can be arranged in order of decreasing eigenvalue. Then, the non-diagonal generators 
can be used to form linear combinations that act as raising and lowering operators, which 
transform one eigenvector to another, changing the eigenvalue by an amount defined by 
the commutation relations of the generators. 

We now see that this generalizes very nicely. 

An arbitrary Lie group is defined in terms of its generators. As we said at the end of 
section | 2.2.2[ it is best to think of the generators as being analogous to the basis vectors 



spanning some space. Of course, the space the generators span is much more complicated 
than M" in general, but the generators span the space the same way. In this sense, the 
generators form a linear vector space. So, we must define an inner product for them. For 
reasons that are beyond the scope of these notes, we will choose the generators and inner 
product so that, for generators T"- and T^, 

(T'^, T^) = - Tr (T'^T^) = 8"^ (2.28) 

K 

where k is some normalization constant. 

Also, in the set of generators of a Lie group, there will be a closed subalgebra of generators 
which all commute with each other, but not with generators outside of this subalgebra. 
In other words, this is the set of generators which can be simultaneously diagonalized 
through some similarity transformation. For SU (2), we saw that there was only one gen- 
erator in this subalgebra which we chose to be J| (recall that a matrix will only commute 
with all other matrices if it is equal to the identity matrix times a constant, whereas two 
diagonal matrices will always commute regardless of what their diagonal elements are). 

Let's say that a particular Lie group has generators total, or is an A^-dimensional group. 
Then, let's say that there are M < N generators in the mutually commuting subalgebra. 
We call those M generators the Cartan Subalgebra, and the generators in it are called 
Cartan Generators. We define the number M as the Rank of the group. 

By convention we will label the Cartan generators if * (z = 1, . . . , M) and the non-Cartan 
generators {i = 1, . . . ,N — M). 



2 



For example, with SU{2) we had = J|, and = J], E"^ = J] 

Before moving on, we point out that this should seem familiar. If you think back to an 
introductory class in quantum mechanics, recall that we always choose some set of vari- 
ables that all commute with each other (usually we choose either position or momentum 
because [x,p] ^ 0). Then, we expand the physical states in terms of the position or mo- 
mentum eigenvectors. Here, we are doing the exact same thing, only in a much more 
general context. 

Now, the Ws are simultaneously diagonalized, so we will write the physical states in 
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terms of their eigenvalues. In an n-dimensional representation Dn, the generators are n x n 
matrices, so the eigenvectors are n-dimensional. So, there will be a total of n eigenvectors, 
and each will have one eigenvalue with each of the M Cartan generators So, for each 
of these eigenvectors, which we temporarily denote for j = 1, . . . ,n, we have the 
M eigenvalues with M Cartan generators, which we call (where j = 1, 



the eigenvectors, and i 
Weight Vector 



1, . . . , n labels 

M labels the eigenvalues), and we form what is called a 



^1 



(2.29) 



where j = 1, . . . ,n. The individual components of these vectors, the t*'s, are called the 
Weights. 

So for a given representation Dn, we now denote the state |-D„; tj) (instead of So, our 
eigenvalues will be 



(2.30) 



As we mentioned before, the adjoint representation is a particularly important represen- 
tation. If you do not remember the details of the adjoint representation, go reread section 



2.2.4 Here, the generators are defined by equation (2.12), [T^]bc = —ifabc- Recall that each 
index runs from 1 to A^, so that the generators in the adjoint representation are N x N 
matrices, and the eigenvectors are A^-dimensional. 

Also, as a point of nomenclature, weights in the adjoint representation are called Roots, 
and the corresponding vectors (as in ( 2.29[ >) are called Root Vectors. 



This means that there is exactly one eigenvector for each generator, and therefore one 



root vector for each generator. So, in equation (2.29), j = 1, . . . , N. We make this more 
obvious by explicitly assigning each eigenvector to a generator as follows. First, because 
we now have the same number of generators, eigenvectors, and root vectors, we label the 
generators by the root vectors T*^ instead of T^. Also, we now refer to general eigenstates 
as \Adj; T*^), where j = 1, . . . , and t, is the M-dimensional root vector corresponding 
to T*^. And, we also divide the states \Adj;T^') into two groups: those corresponding 
to the M Cartan generators \Adj; H^^) (where j = 1, . . . , M and hj is the M-dimensional 
root vector corresponding to H^^), and those corresponding to the N — M non-Cartan 
generators \Adj; E^^) (where j = 1, . . . N — M and ej is the M-dimensional root vector 
corresponding to E^^). 

Don't be alarmed by the superscripts being vectors. We are using this notation for later 
convenience, and T*' here means the same thing did before (the generator). This 
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notation, which we use only for the adjoint representation, is simply taking advantage 
of the fact that in the adjoint representation, the total number of generators, the number 
of eigenvectors of the Cartan generators, the dimension of the representation, and the 
number of weight/ root vectors is the same. 



Also, with the adjoint representation states \Adj; T*^ ), we can use equation (2.28 ) to define 
the inner product between states as 

{Adj ; T*^ I Adj ; T*'^ ) = - Tr (T*^ T*'= ) = 6''' (2.31) 
We will make use of this equation soon. 

The matrix elements of a given generator will then be given by the familiar equation 

- ^fabc = [T%c = {Adj;T'^\T'^\Adj;T'^) 

We want to know what an arbitrary generator T*" will do to an arbitrary state \Adj; T**") 
in the adjoint representation. So, 

T*« \Adj; T*" ) = ^ I Arfj ; T*^ ) {Adj ; T*^ | T*" \Adj; T*" ) = ^ | Adj ; T*^ ) [T*" ] 

c c 
c c 

And, because there is exactly one eigenvector for each generator, the state \Adj; T^") cor- 
responds to the generator T^. And because we know that 

ifabcT'^ = [T'%r^] 

(where c is understood to be summed) by definition of the structure constants, we can 
infer that 

T'^\Ad3-r^) = J2^Uc\Adj-r^) = \Adj- [T*^T*'']) (2.32) 

c 

where [T*" , T**"] is simply the commutator. 



The derivation of equation (2.321 is extremely important, and it is vital that you under- 
stand it. However, it is also one of the more difficult results of this already difficult sec- 
tion. You are therefore encouraged (again) to read through this section, comparing it with 
examples several times until it becomes clear. 

So, let's apply this to combinations of the two types of generators we have, iJ'^^'s and 
E^^'s. If we have a Cartan generator acting on a state corresponding to a Cartan generator, 
we have (from equation ( 2.30[ >) 

H^^\Adj]H^') = hl\Adj]H^'') 
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But from (2.321 we have 



H~^"\Adj;H~^') = \ Adj; [H^\H^']) 

By definition, the Cartan generators commute, so [i/*", i/*''] = 0, and therefore 

hb = Q (2.33) 

So we can drop them from our notation, leaving the eigentstates corresponding to non- 
Cartan generators denoted \Adj] H^). 

On the other hand, if we have a Cartan generator acting on an eigenstate corresponding 

(2.34) 



to a non-Cartan generator, equation (2.30 1 gives 

H''\Adj;E^>') = el\Adj; [H^^E^^]) 



And equation (2.321 gives 



H'^\Adj-E'^) = \Adj-[H\E'^]) 



(2.35) 



Now, we don't know a priori what [if^, E'^*'] is, but comparing ( [234| > and ( [235| ), we see 

\Ad3-elE-'^) = \Ad3-[H\E--^]) 

And because we know that each of these vectors corresponds directly to the generators, 
we have the final result 



(2.36) 



Now we want to know what a non-Cartan generator does to a given eigentstate. Consider 
an arbitrary state \Adj; T*'') with eigenvalue tg. We can act on this with E'^'^ to create 
the new state E^''\Adj; T^^). So what will the eigenvalue of this new state be? Using 
( |236l >, 

H^E^^lAdj; T*") = {H^E^^ - E^^H^ + E^^H^)\Adi] T^") = {[H", E^^] + E^^H'')\Adj] T*") 
= {elE'^ + E'H'^^\Adj-T'^) = {tl + el)E'^\Adj-T'^) 
= {h + earE'-\Adj-r^) (2.37) 

So, by acting on the one of the eigenstates with a non-Cartan generator we have 
shifted the H'^ eigenvalue by one of the coordinates of the root vector. What this means 
is that the non-Cartan generators play a role analogous to the raising and lowering op- 
erators we saw in SU (2), except instead of merely shifting the state "up" and "down", it 
moves the states around through some M-dimensional space. 

From this, we can also see that if there is an operator that can transform from one state 
to another, there must be a corresponding operator that will make the opposite trans- 
formation. Therefore, for every operator E'^'^, we expect to have the operator E^^", and 
corresponding eigenstate \Adj; E'^"). 
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Finally, consider the state E'^^IArfj; E"^"). We know from (232) that E^-^l Adj; E'^") 
\Adj; [E^'^ , E"^"]) . The eigenvalue of this state can be found using equation (2.37): 

H^E^'^\Adj; E-~^^) = {-ea + eafE^'^lAdj; E"""") = 



But according to equation (2.33), states with eigenvalue are states corresponding to Car- 
tan generators. Therefore we conclude that the state E^''\Adj] E~^'') is proportional to 
some linear combination of the Cartan states. 



E'-\Ad3-E-'^) = Y,Nb\Ad3:H') 



(2.38) 



where the N}y's are the constants of proportionality. To find the constants Ni„ we follow 
an approach similar to the one we used in deriving (2.24). Taking the inner product and 
using ( 2.32[ ), 

{Adj- H^\E~^^\Adj; E-~^^) = J2 Nb{Adj; H^\Adj; H^) = ^ NbS"^ = (2.39) 

b b 

{Adj; H'\Adj- [E~'% E-~'^]) = 



Then, using (2.31 ) 



{Ad3-H'\Ad3-[E-''^,E--'^]) 



-Tr{H^[E^%E-^^]) 

K 

-Tr(E-""°[if^E^"'']) 

K 

-e1Ti{E-^-E~^-) 

K 



So, 



N = e'^ 



And therefore equation (2.38) is now 

E''^\Adj-E~'-) = \Adj- [E'-,E-'-]) = el\Adr,H'') 
where the sum over b is understood. This leads to our final result, 

[E^-^E-^"] = e^H^ 



(2.40) 



Though we did all of this using the adjoint representation we have seen before, this struc- 
ture is the same in any representation, and therefore everything we have said is valid 
in any D„. We worked in the adjoint simply because that makes the results easiest to 
obtain. The extensive use we made of labeling the eigenvectors with the generators can 
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only be done in the adjoint representation because only in the adjoint does the number of 
eigenvectors equal the number of eigenstates. However, this will not be a problem. The 



important results from this section are (2.36 1 and (2.40 1, which are true in any representa- 
tion. Part of what we will do later is find these structures in other representations. 

The importance of the ideas in this section cannot be stressed enough. However, the 
material is somewhat abstract. So, we consider a few examples of how all this works. 



2.2.13 Adjoint Representation of SU{2) 



We now illustrate what we did in section 2.2.12 with SU (2). We will work in the adjoint 



representation to make the correspondence with section 2.2.12 as transparent as possible. 



SU (2) has 3 generators, and therefore the adjoint representation will consist of 3 x 3 ma- 
trices. This is simply the j = 1 representation, which we wrote out in equation ( 2.27[ ). 



First, it is easy to verify that (2.28 1 and ( 2.31[ > hold for k = 2. 

Next we look at the eigenstates. We know they will be the normal vectors 



V2 



V3 



(we will relabel them to be consistent with section 2.2.12 shortly). 
Obviously only Jf is diagonal, so SU (2) has rank M = 1. We define 



= Jf 

































1 
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Because the rank is 1, the root vectors will be 1-dimensional vectors, or scalars. We find 
them easily by finding the eigenvalues of each eigenvector with H^: 

So the root vectors are 



tl = tl = +1 t2=t2=0 h 

We can graph these on the real line as shown below. 



-1 



(2.41) 
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Now our initial guess will be to associate with Jf = H^, and then vi = and f 2 = E"^. 
But we want to exploit what we learned in section 2.2.12 and therefore we must make 
sure that ( |2.36| > and ( |2.40| > hold. 



Starting with (2.36 1, we check (leaving the tedious matrix multiplication up to you) 



[H\E'] 
[H\E'] 




(2.42) 



(2.43) 



But we have a problem. According to (2.361, [H^, E'^] should be proportional to E^, but 
this is not the case here. However notice that in ( 2.42| >, 




and in ( |Z43| , 



lE' 




-lE^ 
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Writing this more suggestively, 

[H\E^] = iE\ [H\iE^] = E^ 



(2.44) 



So, if we take the linear combinations of equations (2.441, we get [H^.aE^ ± jSiE"^] = 
PE^ ± aiE"^, which has the correct form of equation (2.36 1 as long as a = (3. Therefore we 
are now working with the operators E'^ = a{E^ ± iE'^). 



Now we seek to impose (2.40 ). We start by evaluating 

[E\E-] 



a'^[E^ +iE'^,E^ -iE^] 
a^{[E\ E'] - i[E\ E^] + z[E\ E^] + [E^, E^]) 
-2ia^[E\E^] = ■■■ 



Then, from equations (2.411 and the definition of E"^, we see that ±e\ = ±(ti — ^2) = 
±(1 — 0) = ±1. So we therefore set «^ = | ^ a = and we find that the appropriate 
non-Cartan generators (including the 1 to be consistent with the notation in section 2.2.12[ > 



are 



E^' = -^(E'±iE^) 
V2 



(2.45) 



which is exactly what we had in equation ( 2.18[ > above. So, we have derived the trick used 
to understand quantum mechanical spin in introductory quantum courses! 



2.2.14 SU{2) for Arbitrary j ... Again 



Now that we have our operators in the adjoint representation, we can consider any arbi- 
trary representation. As we saw in section [2.2.11[ we can form the linear combinations in 
equation (2.45 1 for any j = integer or half integer. The weight vectors will always look like 



those in the diagram on page 58 (in other words, raising and lowering operators always 



raise or lower their eigenvalue by 1). 
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The space of physical states, on the other hand, changes for each representation. For 
3 = \, we have 



1/2 



C) 



-1/2 



For j = 1, 



C) 
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For J = 3/2, 



() 



3/2 



1/2 



-1/2 



-3/2 



and so on. 



Notice that the vectors graphed in the diagram on page 58 are the exact vectors required 
to move from point to point in each of these graphs. This is obviously not a coincidence. 



2.2.15 SU{3) 



Now that we have said pretty much everything we can about SU{2), which is only Rank 1 
(and therefore not all that interesting), we move on to SU (3). However, we will expedite 
the process by stating the structure constants up front. The non-zero structure constants 
are 



/: 



123 



1, /l47 — /l65 — /246 — /257 — /345 — /: 



376 



1 

2' 



/458 — /( 



678 



2 



The most convenient representation is the Fundamental Representation (consisting of 
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3x3 matrices). They are T"- = for a = 1, . . . , 8, where 
/O 1 0\ 




(2.46) 



Clearly, only two of these are diagonal, and A^. So, S'f/(3) is a rank 2 group. 



Before moving on, we summarize a few results (without proofs). An arbitrary SU (n) 
group will always have — 1 generators, and will be rank n — 1. An arbitrary SO{n) 
group (for n even) will always have generators. We won't worry about the rank of 

the orthogonal groups. 

Working in the adjoint representation of SU (3) would involve 8x8 matrices, which would 
obviously be very tedious. So, we exploit the fact that the techniques we developed in sec- 
tion 2.2.12 are valid in any representation, and stick with the Fundamental Representation 



defined by the generators in (2.46 1. 



Proceeding as in section 2.2.13 we note that the eigenvectors will again be 



V2 



V3 



(we will relabel them to be consistent with section 2.2.12 shortly). 
Then, the Cartan generators are 

'1 0' 



0-10 



^0 0^ 

and the non-Cartan Generators are simply 




^2 ^ y2 



E^ = 



E' 



So we have 6 eigenvalues to find, 

1' 



Vl, 



V2, 



2V3J 



H'v3 = (0)1.3 
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So the weight vectors will be 2-dimensional (because the rank is 2). They are 



2 2V3J 



2 2^3 y ' 



We can graph these in as shown below. 



H2 







(2.47) 



H1 



Now, repeating nearly the identical argument we started with equation (2.42 1 and repeat- 
ing it for all 6 non-Cartan generators, we find that in order to maintain (2.36) and (2.401, 
we must work with the operators 



V2 
1 



V2 
1 



-^{T'±^T') = -^{E'±^E') 

^{T'±^T') = ^{E'±^E') 
The weight vectors associated with these will be, respectively. 



(2.48) 



± (ti - h) = ± 



and ± (t2 -h) = ± 



-1/2 
V3/2 



(2.49) 



So, the non-Cartan generators are 



E 



E 



E 



(-VA 
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We are no longer in the adjoint representation, so we had to be more deliberate about 



choosing these linear combinations than we could be in section 2.2.13 What we did here is 



more general; we chose them to be the differences in the three weight vectors in equation 
(2.471, so that these vectors would naturally transform from one eigenvector to another 
(just as the raising and lowering operators do, as we found for SU{2) and more generally 
in section 2.2.12 1. The remarkable property of Lie groups is that this is always possible in 



any representation. 



We can graph the 6 vectors in ( 2.49[ >, along with the two Cartan weight vectors, which we 
know from (12.331) 0- 



H2 



H1 



And again, just as with SU{2), notice that the 6 non-zero vectors are the exact vectors 
that would be necessary to move from point to point on the diagram on page 63 So once 
again, we see that the non-Cartan generators act as raising and lowering operators which 
transform between the eigenstates of the Cartan generators. Notice that there were 6 non- 
Cartan generators, and they formed linear combinations to form 6 raising and lowering 
operators. 



2.2.16 What is the Point of All of This? 



Before finally getting back to physics, we give a spoiler of how Lie theory is used in 
physics. What we are going to find is that some physical interaction (electromagnetism, 
weak force, strong force) will ultimately be described by a Lie group in some particular 
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representation. The particles that interact with that force will be described by the eigen- 
vectors of the Cartan generators of the group, and the eigenvalues of those eigenvectors 
will be the physically measurable charges. Clearly, the number of charges associated with 
the interaction is equal to the number of dimensions of the representation. For example, 
you likely are aware that the strong force has 3 charges, called "colors" (red, green, and 
blue). So, the strong force (we will see) will be in a 3-dimensional representation of the 
group that describes it. 

We will find that all forces carrying particles (photons, gluons, W and Z bosons) will be 
described by the generators of their respective Lie group. The Cartan generators will be 
force-carrying particles which can interact with any particle charged under that group 
by transferring energy and momentum, but do not change the charge (photons and Z 
bosons). This makes sense because Cartan generators are not raising or lowering opera- 
tors. On the other hand, the non-Cartan generators will be force carrying particles which 
interact with any particle charged under that group by not only transferring energy and 
momentum, but also changing the charge {W bosons and gluons). 

We won't be able to come back to discussing how this works until much later, and until 
examples are worked out, this may not be clear. We merely wanted to give an idea of 
where we are going with this. 



2.3 References and Further Reading 



The material in section 2.1 came primarily from Ifl and pO|. The material in 2.2 
from Ha, El, and 113. The sections on SU{2) also came from II3T11 . 



came 



For further reading, we recommend 0, O, Ell, CZI, and BH. 
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3 Part III — Quantum Field Theory 



3.1 A Primer to Quantization 
3.1.1 Quantum Fields 

Our ultimate goal in the exposition that follows is to formulate a relativistic quantum me- 
chanical theory of interactions. So, beginning with the fundamental equation of quantum 
mechanics, Schroedinger's equation, 

H^< = ih— 
dt 

we know that for a non-interacting, non-relativistic particle, H = = —^'V'^, so 

Of course, ^ is in this case a Scalar Field , and therefore only has one state. So, it describes 
a spin-0 particle (or, in the language we have learned in the previous sections, it sits in a 
j = representation of SU (2), which is the trivial representation). And, since \E' does not 
have any spacetime indices, it also transforms trivially under the Lorentz group 5*0(1, 3). 

Notice, however, that we have a fundamental barrier in making a relativistic theory - the 
spatial derivative in (3.1) acts quadratically (V^), whereas the time derivative is linear. 



Clearly, treating space and time differently in this way is unacceptable for a relativistic 
theory. That is a hint of a much more fundamental problem with quantum mechanics; 
space is always treated as an operator, but time is always treated as a parameter. This 
fundamental asymmetry is what ultimately prevents a straightforward generalization to 
relativistic quantum theory. 

To fix this problem, we have two choices: either promote time to an operator along with 
space, or demote space back to a parameter and quantize in a new way. 

The first option would result in the Hermitian operators X, Y, Z, and T. It turns out that 
this approach is very difficult and less useful as far as building a relativistic quantum 
theory. So, we will take the second option. 

In demoting position to a parameter along with time, we obviously have sacrificed the 
operators which we imposed commutation relations on to get a "quantum" theory in the 
first place. And because we obviously can't impose commutation relations on parameters 
(because they are scalars), quantization appears impossible. So, we are going to have to 
make a fairly radical reinterpretation. 
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Rather than letting the coordinates be Hermitian operators that act on the state in the 
Hilbert space representing a particle, we now interpret the particle as the Hermitian operator, 
and this operator (or particle) will be parameterized by the spacetime coordinates. The 
physical state that the particle operators act on is then the vacuum itself, |0). So, whereas 
before you acted on the "electron" |\E') with the operator x, now the "electron" (parame- 
terized by x) acts on the vacuum |0), creating the state (\I'(x^)|0)). In other words, 
the operator representing an electron excites the vacuum (empty space) resulting in an 
electron. We will see that all quantum fields contain appropriate raising and lowering 
operators to do just this. 

This approach, where the quantum mechanical entities are no longer the coordinates act- 
ing on the fields, but the fields themselves, is called Quantum Field Theory (QFT). 

So, whereas before, quantization was defined by imposing commutation relations on the 
coordinate operators [x, p] ^ 0, we now quantize by imposing commutation relations on 
the field operators, [^i, ^2] 7^ 0. 

Because we must still write down the equations of motion which govern the dynamics 
of the fields, we will need to spend the rest of this section coming up with the classical 
equations governing the fields we want to work with. We will quantize them in the next 
section. 



3.1.2 Spin-0 Fields 



As we said above, Schroedinger's equation (3.1) describes the time evolution of a spin-0 



field, or a scalar field. Generalizing to higher spins will come later. Now, we see how to 
make this description relativistic. 

The most obvious guess for a relativistic form is to simply plug in the standard relativistic 
Hamiltonian 



H = v^^VTrnV (3.2) 



Note that, using the standard Taylor expansion + x^ ^ 1 + ^3; for x << 1 gives H ^ 

mc^ + for « c^, which is the standard non-relativistic form (plus a constant) we'd 
expect from a low speed limit. 



Plugging ( 3.2[ > into ( 3.1[ ), we have 



at 

But there are two problems with this: 

1. The space and time derivatives are still treated differently, so this is inadequate as a 
relativistic equation, and 
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2. Taylor expanding the square root will give an infinite number of derivatives acting 
on (p, making this theory non-local. 



One solution is to square the operator on both sides, giving 

^ M% + - = 

Or, if we choose the so called "natural units" or "God units", where c = h = 1, we have 

((9^ - m2)0 = (3.3) 



Equation (3.3 1 is called the Klein Gordon equation. It is nothing more than an operator 
version of the standard relativistic relation E"^ = rri^c^ + p^c^. 

Note that because we will be quantizing fields and not coordinates, there is absolutely 
nothing "quantum" about the Klein Gordon equation. It is, at this point, merely a rela- 
tivistic wave equation for a classical, spinless, non-interacting field. 

Finally, we note one major problem with the Klein Gordon equation. When we squared 
the Hamiltonian H = ^Jvn?c^ + "p^c^ to get = m^c^ + p^c^, the energy eigenvalues 
became E = -^^w^d^ + •pp'f?. It appears that we have a negative energy eigenvalue! Ob- 
viously this is unacceptable in a physically meaningful theory, because negative energy 
means that we don't have a true vacuum, and therefore a particle can cascade down for- 
ever, giving off an infinite amount of radiation. 

We will see that this problem plagues the spin- 1/2 particles as well, so we wait to talk 
about the solution until then. 



3.1.3 Why 5f/(2) for Spin? 

Because we are talking about particles "spinning", a common question is why don't we 
use 5*0(3) instead of SU (2)? The original answer to the question is historical. The experi- 
ments done in the early days of quantum mechanics were not consistent with the particles 
having a rotational degree of freedom in spacetime. Rather, the data indicated that, along 
any given axis, the spin could have only one of two possible values, and 5*0(3) does not 
explain this. Here, however, we consider a more mathematical explanation. 

First, recall that spin is a purely quantum mechanical phenomenon, with no classical 
analogue. Because the data demanded two possible spin states, the field describing the 

particle had to have 2 spin components, \1/ = I ^ . Now, if we seek a 2-dimensional 
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representation of 50(3), we find that there is only one: Dq © Dq, the trivial representation 
consisting of all I's. This means 



which is no transformation. This is the only 2-dimensional representation of SO (3) that 
is possible. 

The solution to the problem is found in one of the many peculiarities of quantum me- 
chanics. The only physically measurable quantity in quantum theory is the probability 

amplitude, which is proportional to the square of \1'. Therefore, the state I ^ 1 is physi- 



cally identical to 




Now consider a general element of S0{3): e'''^<t>Jx+i'Jv+^J^) _ On the other hand, a general 
element of SU{2) will be e*(<^^+'/'^+^^). 

Now consider rotating the system by an angle of 2tx around, say, the z axis. The 50(3) 
element corresponding to this rotation will be e*^'^"^^ while the SU{2) element will be 
giTTcr^ jj-^g factor of 1 /2 difference means that the spinor space rotates through only half 
the angle of the 50(3) does. So, in the 27r rotation, U G SU{2) — > —U, whereas R E 
50(3) — > R. Therefore, both U and —U correspond to R. There is a 2 to 1 correspondence 
between SU{2) and 50(3). 

And, as we said above, spin is a purely quantum mechanical effect and experimentally 
only allows 2 values, but 50(3) has no such representation, whereas the j = 1/2 rep- 
resentation of SU{2) does. We therefore use SU{2). And, because SU{2) is 2 ^ 1 with 
50(3), but spin is quantum mechanical, both U and —U can consistently correspond to 
the same R E 50(3). The minus sign difference is not subject to measurement; only |\E'p 
is physically measurable. 

An important thing to understand is that "spin" is not a rotation through spacetime in any 
meaningful way. It is a rotation in "spinor space", which is an internal degree of freedom. 
Like many things in quantum mechanics, spinor space is a mathematical structure. All 
we can say for certain is what we can measure, or know (|\E'P), not what "is". 



3.1.4 Spin I Particles 



Finding equation ( 3.3[ > was easy because scalar fields have no spacetime indices and no 
spinor indices, and they therefore transform trivially under SU{2) and the Lorentz group. 

A particle of spin 1/2 however, will have two complex components, one for spin +1/2, 



69 



and the other for spin —1/2. So, we describe such a particle as the two-component Spinor 



^1 

^2 



where tpi and iJj2 are both G C. So, we want some differential operator in the form of 2 x 2 
matrices to act on such a field to form the equation of motion. 

Following Dirac's approach, he reasoned that given such a 2 x 2 operator, the equation of 
motion should somehow "imply" the Klein Gordon equation (which merely makes the 
theory relativistic). So his goal (and our goal) is to find an equation with a 2 x 2 matrix 
differential operator acting on ip that results in ( 33| |. 



Dirac's approach was to find an operator of the form 

P = Yd, = ^do + j'd, + + t'^s 

where the 7's are 2x2 matrices, and the equation of motion is then pij) = —imtjj. The 
challenge is in finding the appropriate 2x27 matrices. Dirac reasoned that, in order to 
be properly relativistic, operating twice with p should give the Klein Gordon equation. 
In other words, 

p = —imip =^ ppip = —impip 

=^ 7^<9^7'^<9vV' = —im{—imip) 

^ {YYd>.d, + m^)ij = (3.4) 



This wi ll yi eld the Klein Gordon equation if 7/^7'' = —rj^^'^I. Or, using the symmetry of the 
sum in (3.4 1, it will yield the Klein Gordon equation if we demand ^(7'^7''+7'^7^) = —rj^^l. 



Consider 



If the 7 matrices satisfy (3.5), then (3.4) gives 



which is exactly the Klein Gordon equation (3.3). 
So, we have the Dirac equation 

[p + im)il) = 



(3.5) 



(3.6) 



but we still have a problem. Namely, there does not exist a set of 2 x 2 matrices that solve 
( |3.5[ ). Nor does there exist a set of 3 x 3 matrices. The smallest possible size where this is 
possible is 4 X 4. Obviously, if we want to describe a spin-1/2 particle with exactly 2 spin 



70 



states, using 4 spin components does not seem right. But, we will accept the necessity of 
4x4 Dirac matrices and move on. 

Instead of using ip = (^^^ > ^i^^ define the two 2-dimensional spinors 



and = 



^3 
^4 



and the 4-component spinor 



(3.7) 



(3.8) 



Now it is possible to solve (3.5 1. Such a problem is actually very familiar to algebraists, 
and we will not delve into the details of how this is done. Instead, we merely state one 
solution (there are many, up to a similarity transformation). We define the 4x4 matrices 



-o' 




and 



7 









a 



(3.9) 



where a" is the 2x2 identity matrix, and a* are the Pauli spin matrices. It should be no 
surprise that they show up in attempting to describe spin- 1/2 particles. What is interest- 
ing is that we did not assume them — we derived them using ( |3.5[ >. 

Before moving on, notice that we have initiated a convention that will be used throughout 
the rest of these notes. Whenever a greek index is used, it runs over all spacetime indices. 



Whenever a latin index is used, it runs over only the spatial part. So in (3.9 1, % runs 1,2,3. 



Now that we have an explicit form of the Dirac gamma matrices, we can write out (3.6 1 
explicitly: 



/ 


\di + 182 do - 83 
Or, in terms of ipL and ^/jr, 



do - 63 
-di- 182 





-di + id2\ 
do + ^3 







where we have defined the 4-vectors = 



+mipL 









^2 


= —im 
















and 







-a- 



3.1.5 The Lorentz Group 

This section is intended to give a deeper imderstanding of why we were unable to find a 



2x2 matrix representation to solve (3.5) 
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Recall that the driving idea behind the derivation of the Dirac equation (3.6) was to make 
it imply the Klein Gordon equation, or in other words to be a relativistic theory. Put 
another way, it was to create a theory that was invariant under the Lorentz group SO (1,3). 
So, let's take a closer look at the Lorentz group. 



We know from section 1.1.5 that the Lorentz group consists of 3 rotations and 3 boosts. 



We gave the general forms of these transformations in equations (1.5 1 and (1.6 1. It is 



easy, using those general expressions in addition to (2.7 1, to find all 6 generators, and 
then multiply them out to get the commutation relations. We spare the (easy but tedious) 
details and simply state the commutation relations. If we label the generators of rotation 
J* (i = 1, 2, 3), and the generators of boosts K' {i = 1, 2, 3), then the commutation relations 
are 

[J\J^] = ie^^^J^ 
[J\K^] = ie'^^K^ 
[K\K^] = -le'^^J^ 



In order to make the actual structure of this group more obvious, we define two new 
linear combinations of these generators: 

N' = ]^{f-iK') N'^ = ]^{f + iK') 

Writing out the commutation relations for A^* and A^*^, we get 

[N\N^^] = 

So, both A^* and A^*^ separately form an SU (2). In more mathematical terms, we say that 
S0(l,3) is Isomorphic to SU{2) ® SU{2), which we denote 50(1,3) ^ SU{2) ® SU{2). 
While the idea of an isomorphism is a very rich mathematical idea, for now you can 
simply think of it as a way of saying that two groups have the same group structure. 

So, because a given representation of SU{2) is defined by the value of j, we can see that 
a particular representation of the Lorentz group 50(1,3) = SU{2) SU{2) is defined 
by two values of j, or by the doublet (j, j')- Th^ smallest possible representation then is 
(j? j') = (0, 0). This has one state from j = and one state from j' = 0, and therefore has 
1x1 = 1 state total. Therefore, this representation describes a scalar field. 

Then, there is the state (0, 1/2), which will have one state from j = 0, but two states from 
j = 1/2, for a total of 1 x 2 = 2 states. Therefore, this describes a single spin-1/2 field. We 
call this field i/jl, and the (0, 1/2) representation the Left-Handed Spinor Representation 
of the Lorentz group. 
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Clearly, we will also have the representation (1/2, 0), which also has 2 states, correspond- 
ing to the 4'r field. This is called the Right-Handed Spinor Representation of the Lorentz 
group. This is the reason for the notation used in (3.7). The left-handed (0, 1/2) represen- 
tation acts onipL and the right-handed (1/2, 0) representation acts on ipn. 



Next is the representation (1/2,1/2), which has two states from j = 1/2 and two from the 
f = 1/2 for a total of 2 x 2 = 4 states. It turns out that this representation is the space- 
time vector representation we use to act on spacetime vectors for the standard Lorentz 
transformations discussed in section [l.l.5[ 



Now, an SU (2) representation specified by some j is an irreducible representation, and 
therefore the tensor products SU (2) ® SU (2) specified by the doublet (j, j') are irreducible. 
This means that there are no irreducible subspaces, and so given a representation 
there is a particular transformation taking the state to For the (0,0) and 

(1/2, 1/2) representations this doesn't affect anything. However, this fact means that the 
(0, 1/2) and (1/2,0) representations must always appear together. To put this in more 
mathematical language, our choices for representations of the Lorentz group are 



(0,0), (1/2,1/2) and (1/2,0)0(0,1/2) 



which are 1, 4, and 4-dimensional representations, respectively. Furthermore, they are 
the representations which transform Klein Gordon scalar/ spinor-0 fields, spacetime 4- 
vectors, and spin- 1/2 spinor s, respectively. 

The physical meaning of this fact is that relativity demands that if you want a theory 
with spin-1/2 particles, you cannot have them existing by themselves. They must come 
in pairs, each transforming under an SU{2) representation of opposite handedness. In 
the next two sections we will discuss ways of interpreting this fact, starting with Dirac's 
original approach which, while brilliant, didn't ultimately work. Then we will consider 
what appears to be the correct view. 



3.1.6 The Dirac Sea Interpretation of Antiparticles 



Initially, it may seem that the impossibility of finding a 2 x 2 matrix solution to (3.5 1 
means that we can't have fields with 2 spinor states. However, we saw in the last section 
that we aren't limited to scalars and spacetime 4-component spinors. We can also have 
two fields, ipL and ipji, which can be paired together to form two spin-1/2 fields in a 4- 

component spinor ip = (^^^ ■ Dirac was faced with the challenge of both interpreting 

this, while at the same time dealing with the negative energy states mentioned in section 
I3T21 

Dirac's solution, though today abandoned, was brilliant enough to mention. He sug- 
gested that because spin-1/2 particles obey the Pauli Exclusion Principle, there could be 
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an infinite number of particles already in the negative energy levels, and so they are al- 
ready occupied, preventing any more particles from falling down and giving off infinite 
energy Thus, the negative energy problem was solved. 

Furthermore, he said that it is possible for one of the particles in this infinite negative sea 
to be excited and jump up into a positive energy state, leaving behind a hole. This would 
appear to us, experimentally, as a particle with the same mass, but the opposite charge. 
He called such particles Antiparticles. For example, the antiparticle of the electron is the 
antielectron, or the positron (same mass, opposite charge). The positron is not a particle 
in the same sense as the electron, but rather is a hole in an infinite sea of electrons. And 
where this negative charge is missing, all that is left is a hole which appears as a positively 
charged particle. 

So, ipL describes a particle, and due to the infinite sea of negative particles, there can 
always be a hole, which will be described by tpL- Everything about this worked out math- 
ematically, and when antiparticles were detected about 5 years after Dirac's prediction of 
them, it appeared that Dirac's suggestion was correct. 

However, there were two major problems with Dirac's idea, and they ultimately proved 
fatal to the "Dirac Sea" interpretation: 

1. This theory, which was supposed to be a theory of single particles, now requires an 
infinite number of them. 

2. Particles like photons, pions, mesons, or Klein-Gordon scalars don't obey the Pauli 
Exclusion Principle, but still have negative energy states, and therefore Dirac's ar- 
gument doesn't work. 

However, his labeling them "antiparticles" has stuck, and we therefore still refer to the 
right-handed part of the spin- 1/2 field as the antiparticle, whereas the left-handed part is 
still the particle. 

For these reasons, we must have some other way of understanding the existence of the 
antiparticles. 



3.1.7 The QFT Interpretation of Antiparticles 



In presenting the problem of negative energy states, we have been somewhat intention- 



ally sloppy. To take stock, we have two equations of motion: the Klein Gordon (3.3) for 



scalar/ sptn-0 fields, and the Dirac equation (3.6 1 for spin-1/2 particles. 



And in our discussion of negative energy states, we were "pretending" that the tp's and 



')'s are "states" with negative energy. But, as we said in section 3.1.1 QFT offers a different 



interpretation of the fields. Namely, the fields are not states — they are operators. And 
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consequently they can't have energy. A state is made by acting on the vacuum with either 
of the operators (p or tp, and then the state 0|O) or ijj\0) has some energy. 

So, QFT allows us to see the antiparticle as a real, actual particle, rather than the absence 
of a particle. And, we do not need the conceptually difficult idea of an infinite sea of 
negative energy particles. The vacuum |0), with no particles in it, is now our state with 
the lowest possible energy level. And, as we will see, there are never negative energy 
states with these particles. 

How exactly |0) works will become clearer when we quantize. The point to be understood 
for now is that QFT solves the problem of negative energy by reinterpreting what is a state 
and what is an operator. The fields </) and are operators, not states, and therefore they 
do not have energy associated with them (any more than the operator x or did in non- 
relativistic quantum mechanics). So, without any problems of negative energy, we merely 
accept that nature, due to relativity, demands that particles come in particle/ antiparticle 
pairs, and we move on. 



3.1.8 Lagrangians for Scalars and Dirac Particles 



Now that we have the equations of motion (3.3 1 and (3.6 1, we want to know the actions 
that lead to these equations of motion. In order to save time, we will merely write down 
the answers and let you take the variations to see that they do indeed lead to the Klein 
Gordon and Dirac equations of motion for cp and ^p. 

They are 



^KG 



-^dyd,<p 



-rn^cp 



iip\a^d^,ipL + iip^RCT^d^ipR - m{ip\ipR + ip'^^ipi 



(3.10) 
(3.11) 



where the dagger represents the Hermitian conjugate, ip\ = (^*,?/'2)' '^r = (^35 ^^4)/ 
usual. You can actually take the variation of Co with respect to either ip'l and ip]^ to get 
the equations of motion for ipL and ^p]i, or you can take the variations with respect to 
ipL and ipR to get the equations for ip]^ and ipR- The two sets of equations are simply the 
conjugates of each other, and therefore represent a single set of equations. 



In order to simplify (3.11), the convention is to use the Dirac gamma matrices (3.9 1 to 
define tp = tp'^'j'^ (where 4' here is the 4-component spinor in equation (3.8 1). Using this, 
all 4 terms in ( 3.11| > can be summarized as 



C 



D 



ip{i'~^^d^ — m)ip 



(3.12) 
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3.1.9 Conserved Currents 



In Part I we discussed how symmetries and conserved quantities are related. Let's con- 
sider a few examples of this using the Lagrangians we have now defined. 

Consider a massless Klein Gordon scalar particle, described by £ = —^d'^cpd^^cj). Following 
what we did starting with equation ( 1.2[ >, consider the transformation (p ^ (f) + e, where 



e is a constant. Because d^(j) d^cj) + d^e = d^cj), the Lagrangian is invariant. So (using 
5(t) = 1), our conserved quantity is 

dr. 

Or, consider the Klein Gordon Lagrangian with complex scalar fields (p and which 
we write as £ = —d^cjy^cjy^cj) — m?(j)^(l). We can make the transformation cp e*"0 and 
(j)'^ — > 0l^e"*" (where a is an arbitrary real constant). This type of transformation is called a 
f/(l) transformation, because e*" is an element of the group of all 1 x 1 unitary matrices. 



as discussed in section 2.2.1 



The conserved quantity associated with this U{1) symmetry is 

dC dC 



Or consider the Dirac Lagrangian. Notice that it is invariant under the U{1) transforma- 
tion 'ip e*", with current 

f = (3.13) 

In both of the previous examples, notice that the U{1) symmetry changes the field at all 
points in space at once, and all in the same way. In other words, it is a single overall con- 
stant phase e*". We therefore call such a symmetry a Global Symmetry. The implications 
of this are likely not clear at this point. We merely wish to call your attention to the fact 
that e*" has no spacetime dependence. 



3.1.10 The Dirac Equation with an Electromagnetic Field 



Previously we found the Lagrangian for an electromagnetic field (1.131. Our goal now is 
to find a Lagrangian that describes the electromagnetic field and a spin-1/2 particle that 
couples to the electromagnetic field, and additionally the interaction between them. We 
start by writing down a Lagrangian without any interaction. This will simply be the sum 
of the two terms, 

C = Cd + Cem = i'iirO, - m)^ - ^F,,F>^'' - J^A^, (3.14) 
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But, because the Dirac part has no terms in common with the electromagnetic part, the 
equations of motion and the conserved quantities for both ^ and will be exactly the 
same, as if the other weren't present at all. In other words, both fields go about their way 
as if the other weren't there — there is no interaction in this theory Because this makes for 
a boring universe (and horrible phenomenology), we need to find some way of coupling 
the two fields together to produce some sort of interaction. 

Interaction is added to a physical theory by adding another term to the Lagrangian called 
the Interaction Term. So, the final Lagrangian will have the form C = Cd + ^em + ^int- 

Now, for reasons that will become clear in the next section (and even more clear when 
we quantize), we do this by coupling the electromagnetic field to the current resulting 



from the U{1) symmetry in Cjj, which we discussed in section 3.1.9 and wrote out in 



equation ( |3.13[ >. In other words, our interaction term will be proportional to A^j^ 

So, adding a constant of proportionality q (which we will see has the physical interpreta- 
tion of a coupling constant, weighting the probability of an interaction to take place, or 
equivalently the physical interpretation of electric charge), our Lagrangian is now 

C = ij{zYd,-m)ij-^F,,F^''-J^'A,~qfA, 

= tlj{trd^-m)tlj-^F^,F^'' -{J>^ + qijY^)A^ (3.15) 

Notice that C is still invariant under the global U{1) symmetry, and the U{1) current is 
still = tp'jfip. 



Also, notice that the Lagrangians in (3.14) and (3.15) are the same except for a shift in the 
current term, J'^ — > J'^ + qj^. Recall that physically, = (p, J) represents the charge 
and current creating the field. The fact that has shifted in ( 3.15[ ) simply means that the 



sptn-1/2 particle in this theory contributes to the field, which is exactly what we would 
expect it to do. 

If we set q = e, the electric charge, this Lagrangian becomes upon quantization the La- 
grangian of Quantum Electrodynamics (QED), which to date makes the most accurate 
experimental predictions ever. 

In the next section, we will re-derive this Lagrangian in a more fundamental way. 



3.1.11 Gauging the Symmetry 



Physically speaking, this section is among the most important in these notes. Read this 
section again and again until you understand every step. 



it is invariant 



Consider once again the Dirac Lagrangian ( 3.6[ ). As we said in section 3.1.9 
under the global U{1) transformation -ip — ^ e^°'-ip. It is global in that it acts on the field the 
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exact same way at every point in spacetime. The idea behind this section is that we are 
going to make this symmetry Local, so that a depends on spacetime {a = a(x^^)), and then 
try to force the Lagrangian to maintain its invariance under the local U{1) transformation. 
Making a global symmetry local is referred to as Gauging the symmetry. 

We start by making the local U{1) transformation: 

and because the differential operators will now act on a{x) as well as ip, we get extra 
terms: 

— '0(i7''9^ — m — ^^d,j,a{x))'^ 

If we want to demand that C still be invariant under this local U{1) transformation, we 
must find a way of canceling the 'ijjj'^ijjdij,a{x) term. We do this in the following way. 

Define some arbitrary field which under the U (1) transformation e*"*^^^ transforms ac- 
cording to 

A^^A^- ^d^a{x) (3.16) 

We call Afj_ the Gauge Field for reasons that will be clear soon, and g is a constant we have 
included for later convenience. 

We introduce by replacing the standard derivative 9^ with the Covariant Derivative 

D^ = d^ + iqA^ (3.17) 

If you have studied general relativity or differential geometry at any point, you are famil- 
iar with covariant derivatives. There is an incredibly rich geometric picture of all of this, 
but it is beyond the scope of these notes. We will deal with it later in this series, however. 

As a comment regarding vocabulary, to say that a particle "carries charge" mathemati- 
cally means that it has the corresponding term in its covariant derivative. So, if a parti- 
cle's covariant derivative is equal to the normal differential operator d^, then the particle 
has no charge, and it will not interact with anything. But if it carries charge, it will have a 
term corresponding to that charge in its covariant derivative. This will become clearer as 
we proceed. 

So, our Lagrangian is now 

C = i^ii-f^D^ - 171)1/; = i^in^idi, + igAJ - 171)1/; = -0(^7^9^ - m - ^7"^^)^' 
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And under the local U{1) we have 

= ^{h^d^ - m - Yd^,a{x) - q^^^ + Yd^a{x))i) 
= ijj{i'j^d^ — m — q'j^A^)^ = ijj^i'j'^D^ — m)^ 
= C 

So, the addition of the field has indeed restored the f/(l) symmetry. Notice that now it 
is not only invariant under this local U{\), but also still under the global U (1) we started 
with, with the same conserved U{1) current = ipj^tp. This allows us to rewrite the 
Lagrangian as 

£ = t/jii^D^ - m)^p = tpii^d^ - m)i) - qfA^ (3.18) 



But we have a problem. If we want to know what the dynamics of A^ will be, we nat- 
urally take the variation of the Lagrangian with respect to A^. But because there are 
no derivatives of A^, the Euler-Lagrange equation is merely = —qijj'y'^'ijj = 0. But 
—qtjj'j'^tp = —qj^. So the equation of motion for A^ says that the current vanishes, or that 



= 0, and so the Lagrangian is reduced back to (3.121, which was not invariant under 
the local f/(l). 

We can state this problem in another way. All physical fields have some sort of dynamics. 
If they don't then they are merely a constant background field that never changes and 



does nothing. As it is written, equation (3.18) has a field A^ but A^ has no kinetic term, 
and therefore no dynamics. 



So, to fix this problem we must include some sort of dynamics, or kinetic terms, for A 



^J■■ 



The way to do this turns out to involve a considerable amount of geometry which would 
be out of place in these notes. We will cover the necessary ideas in a later paper in this 
series and derive the following expressions. For now we merely give the results and ask 
you for patience until we have the machinery to derive them. 

For an arbitrary field A^, the appropriate gauge-invariant kinetic term is 
where 

i^M^ = ImM /^^'l (3.19) 
Q 

and q is the constant of proportionality introduced in the transformation of A^ in equation 



(3.161. is the covariant derivative defined in (3.17). 
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Writing out (3.191 (and using an arbitrary test function f{x)), 



= [(9^ + iqA^") {d"" + iqA") - {d"" + iqA^) {d^" + iqA^"] f{x) 

= [^^'^''f{x) + + iqA^'d" f{x) - q^A^'A''f{x) 

-d'^d^'fix) + tqd''{A^'f{x)) + iqA'^d^'fix) - q^A-'A^'fix)] 
= - \iqf(x)d''A'' + iqA'^d^'fix) + iqA^'d'^fix) - q^A^'A^fix) 

q 

-iqf{x)d''A'' - iqA^'d'^fix) - iqA'^d^'fix) + 
= [d^'A'' -d'A^" + iq[A'',A'']]f{x) 

But for each value of /i, A^ is a scalar function, so the commutator term vanishes, leaving 
(dropping the test function f{x)) 

p^^v = -10", D"] = d^A" - d^A" (3.20) 



So, writing out the entire Lagrangian we have 

1 
4 



And finally, because A^ is obviously a physical field, we can naturally assume that there 
is some source term causing it, which we simply call J^. This makes our final Lagrangian 
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Comparing this to (3.151 we see that they are exactly the same. So what have we done? 
We started with nothing but a Lagrangian for a spin- 1/2 particle, which had a global U{1) 
symmetry. Then, all we did was promote the U{1) symmetry to a local symmetry (we 
gauged the symmetry), and then imposed what we had to impose to get a consistent 
theory. The gauge field A^ was forced upon us, and the form of the kinetic term for A^ is 
demanded automatically by geometric considerations we did not delve into. 

In other words, we started with nothing but a non-interacting particle, and by specify- 
ing nothing but U{1) we have created a theory with not only that same particle, but also 
electromagnetism. The field, which upon quantization will be the photon, is a direct 
consequence of the U (1). 



This is what we meant at the end of section 2.2.11 when we said that electromagnetism is 



described by U{1). We will talk more about the weak and strong forces later, as well as 
the groups that give rise to them. 
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Theories of this type, where we generate forces by specifying a Lie group, are called 
Gauge Theories, or Yang-Mills Theories. 



Finally, notice that (3.161 has exactly the same form as (1.151. This is why we call a 
gauge field. The gauge symmetry in electromagnetism is a sort of remnant of the much 
deeper and more fundamental U{1) structure of the theory. 



3.2 Quantization 



3.2.1 Review of What Quantization Means 



In quantum mechanics (not QFT), quantization is done by taking certain dynamical quan- 
tities and making use of the Heisenberg Uncertainty Principle. Normally we take posi- 
tion X and momentum p and, according to Heisenberg, the measurement of the particle's 
position will effect its momentum and vice-versa. 

To make this more precise, we promote x and p from merely being variables to being 
Hermitian operators x and p (which can be represented by matrices) acting on some vector 
space. Calling a vector in this space l^jj), physically measurable quantities (like position 
or momentum) become the eigenvalues of the operators x and p, 

x\ip) = x\ip) 
p\^) =p\^) 



Heisenberg Uncertainty says that measuring x will affect the value of p, and vice-versa. 
It is the act of measuring which enacts this effect. It is not an engineering problem in 
the sense that there is no better measurement technique which would undo this. It is a 
fundamental fact of quantum mechanics (and therefore the universe) that measurement 
of one variable affects another. 

So, if we measure x (using x) and then p (using p), we will in general get different values 
for both than if we measured p and then x. More mathematically, xp ^ px. Put another 
way, 

[x,p] = xp — px 

For reasons learned in an introductory quantum course, the actual relation is 

[x,p] = ih (3.21) 



where h is Planck's Constant. We call (3.21 ) the Canonical Commutation Relation, and it 



is this structure which allows us to determine the physical structure of the theory. 

More generally, we choose some set of operators that all commute with each other, and 
then label a physical state by its eigenvectors. For example x, y and z all commute with 
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each other, so we may label a physical state by its eigenvectors I?/',,) = \x,y,z). Or, 
because p^, Py> and all commute, we may call the state \ipp) = \px,Py,Pz)- We may 
also include some other values like spin and angular momentum, to have (for example) 

1^) = \x,y,z,S;„L^, . . .). 



As discussed in section 3.1.1 when we make the jump to QFT, the fields are no longer the 
states but the operators. We are therefore going to impose commutation relations on the 
fields, not on the coordinates. 

Furthermore, whereas before the states were eigenvectors of the coordinate operators, we 
now will expand the fields in terms of the eigenvectors of the Hamiltonian. 



3.2.2 Canonical Quantization of Scalar Fields 



We begin with the Klein Gordon Lagrangian in equation (3.10), but we make the slight 
modification of adding an arbitrary constant fi, 

CKG = -ld''<l>d^<l>-lm^<l>^ + ^ 
Note that Vl has absolutely no affect whatsoever on the physics. 

Quantization then comes about by defining the field momentum and Hamiltonian (using 
( |L7l > and (|L8|)) to get 



djC 

0(p{X) 

n = n0-/: = -n2 + i(V(/))2 + -mV-^ (3.23) 

2 2 2 



Now, using the canonical commutation relations (3.21 1 as guides, we impose 



[(Pit, x), (Pit', x')] =0 

[nit, x), nit', x)] = 

[(Pit, x), nit', x')] = i5it - t')5ix - x') (3.24) 



(where we have set h = \). 



We can see more clearly what this means if we expand the solutions of the Klein Gordon 
equation. One solution is plane waves, e*'^'^ where 

UJ = (3.25) 

and k is the standard wave vector. 
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So, we write the field cj) as 

where f{x) is a redundant function which we have included for later convenience. For 
now, both a{k) and b{k) are merely arbitrary coefficients (integration constants) used to 
expand x) in terms of individual solutions. 

We demand that x) be Hermitian. This requires 

Then, changing the sign of the integration variable k on the second term in the integral 
allows us to use 4-vector notation, so 



[X 



d'^k - 

-j^^[a{k)e^'^- + a\k)e-^'-] 



where k ■ x = k^^x^. 



Now notice that the integration measure, d^k, is not invariant under Lorentz transforma- 
tions (because it integrates over the spatial part but not over the time part). We therefore 
choose fiji) to restore Lorentz invariance. 

We know that the measure d^k would be invariant, as would 5 functions and 6 (step) 
functions. So, consider the invariant combination 

d^M(A;2 + m2)e(A;°) (3.26) 

The 5 function merely requires that relativity hold (A;^ + m? is simply the relativistic rela- 
tion ( 3!2] >, and the 6 function preserves causality. So this is a physically acceptable Lorentz 



mvariant mtegration measure. 
Recall the general 5 function identity, 

dx5{g{x)) = J2 



dx 

where the x/s are the zeros of the function g{x). We can do the A;° integral over measure 



(3.261, and using the fact that the zeros of k"^ + w? = k"^ — k^ko + rn? in terms of k^ are 

P/co = k'^ + = u? , we get 

d'kdk'>6{k'' + m')e{k^) = J ^ 

So, adding a factor of (27r)^ for later convenience, we take our invariant measure to be 

d^k 



(27r)32u; 
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So finally, 

= J dk[a{k)e'^-^ + a^^e-'^-""] (3.27) 



where we have defined dk = 



(27r)32c<;- 



The commutation relations we defined in (3.241 will now hold provided we impose 

[a{k),a{k')] = 
[a\k),a\k')] =0 

[a{k),a\k')] = {27T)^2ujS^{k - k') (3.28) 

(showing this is fairly tedious, but we encourage you to work it out). We are using f 
instead of * to emphasize that, in the quantum theory, we are talking about Hermitian 
operators. The operators a{k) and a)(k) are scalars, so in this case a* = a^. 

Furthermore, we can write the Hamiltonian H in terms of ( 3.27| ): 

= \j dkdk'd'^x[{-iua{k)e'^-'' + iua\k)e-'^^^^ 

+ {ika{k)e'^-'' - ika\k)e-'''-'') ■ {ik' a{k')e'^'-'' - ik' a*{k')e~'^'-'') 
m2(a(^)e*^' + a\k)e-'''-''){a{k')e'^'-'' + a*{k')e-'^'-'')] - j d^xVt 

dkdk'd'^x[i-uu'a{k)a{k')e'^''+'''>'' + ujuj' a{k)a\k')e'^^-^'^-'' 

+uuo'a''{k)a{k')e~'^^~^'^- - uuo'a\k)a\k')e~'^''+^'^-'') 
+ {-k ■ fc'a(fc)a(F)e*('=+'''>" + k ■ k'a{k)a*{k')e'^''~'''>'' 

+k ■ k'a\k)a{k')e-''^^~^'^-'' - k ■ k' a\k)a\k')e-'^^+^'^-'') 
W(a(fc)a(fc')e*^''+^''^" + a{k)a\k')e'^^-^"^-'' 

~VVl 

where V is the volume of the space resulting from the / d?x integral. Then, from the fact 
that / fc"-^ = (27r)353(^), wehave 

H = ]^{2T:f j dkdk'[5''{k - k'){uu' + k-k' + m2)(a'^(fc)a(F)e-*("-"')* + a{k)a*{k')e-'^^~^'^') 

+5^{k + k'){-ujuj' -k-k' + m^){a{k)a{k')e-'^^+^'^' + a'{k)a*{k')e'^''+^'^') 

-vn 

= - [ dk—[{u;^ + P + m^){a*{k)a{k) + a{k)a\k)) 
2 J 2ui 

+ {-uj^ + ¥ + m''){a{k)a{-k)e-'^'^' + a\k)a\-k)e^''^')] - VVL 
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and finally, using the definition of uj (equation ( 3.25[ >), this becomes 

H =^ jdk Lj{a*{k)a{k) + a{k)a*{k)) - VQ 



And now, using (3.281, we can rewrite this as (switching from t^t to f to emphasize the 
Hermitian nature) 



H 



dk uj{a\k)a{k) + a{k)a\k)) - Vn 

dk uj{a\k)a{k) + {2TTf2uj6^{k -k) + a){k)a{k)) - VVt 

dk uoa\k)a{k) + j dk iu{27rfS%0) - Vn 

d^k 



dkua' {k)a{k) + 



(27r)32cj 



uj{2TTy6%o) - vn 



dkuja\k)a{k) + -5^(0) / d^k - Vn 



Notice that both the second and third terms are infinite (assuming the volume V of the 
space we are in is infinite). This may be troubling, but remember that Vt is an arbitrary 
constant we can set to be anything we want. So, let's define 



leaving 



H 



dk uja^{k)a{k) 



(3.29) 



Remember that measurement can only detect changes in energy, and therefore the infin- 
ity we subtracted off does not affect the value we will measure experimentally. What 
we have done here, by subtracting off the infinite part in a way that doesn't change the 
physics, is a very primitive example of Renormalization. Often, for various reasons, 
measurable quantities in QFT are plagued by different types of infinities. However, it is 
possible to subtract off those infinities in a well-defined way, leaving a finite part. It turns 
out that this finite part is the correct value seen in nature. The reasons for this are very 
deep, and we will not discuss them (or general renormalization theory) in much depth in 
these notes. For correlating theoretical results with experiment, being able to renormalize 
results correctly is vital. However, our goal is not to understand the subtleties of renor- 
malization, but to understand the overall structure of particle physics. When you take a 
course on QFT you will spend a great, great deal of time on renormalization, and a deeper 
understanding of it will emerge. 
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So, we have our field expansion ( 3.27[ > and commutation relations (3.281. Notice that 
( |3.28[ > have the exact form of a simple harmonic oscillator, which you learned about in 
introductory quantum mechanics. Therefore, because they have the same structure as 
the harmonic oscillator, they will have the same physics. By doing nothing but imposing 
relativity, we have found that scalar fields, which are Hermitian operators, act as raising 
and lowering (or synonymously creation and annihilation) operators on the vacuum (just 
like the simple harmonic oscillator). 



Comparing (3.281 with the standard harmonic oscillator operators, it is clear that a'^{k) 
creates a (p particle with momentum k and energy to, whereas a{k) annihilates a (p particle 
with momentum k and energy u. A normalized state will be 



\k) = V2uja\k)\0) 



(3.30) 



The entire spectrum of states can be studied by acting on |0) with creation operators, and 
probability amplitudes for one state to be found in another, {kf\ki), are straightforward 
to calculate (and positive semi-definite). Naturally this theory does not discuss any inter- 
actions between particles, and therefore we will have to do a great deal of modification 
before we are done. But this simple exercise of merely imposing the standard commu- 



tation relations (3.241 between the field and its momentum, we have gained complete 
knowledge of the quantum mechanical states of the theory. 



3.2.3 The Spin-Statistics Theorem 



Notice that the states coming from (3.30 1 will include the two particle state 



\k;k') =2ViJ^ a^{k)a^k')\0) 



(3.31) 



But the commutation relations (3.28) tell us that a'^{k)a'^(k') = {k')a^ (k). So, this theory 
also allows the state 



\k';k) = 2Vi^a\k')a\k)\0) 



(3.32) 



Recall from a chemistry or modem physics course that particles with half-integer spin 
obey the Pauli Exclusion Principle, whereas particles of integer spin do not. Our Klein 
Gordon scalar fields are spinless (j = 0), and therefore we would expect that they do not 
obey Pauli exclusion. The fact that our commutation relations have allowed both states 
( |3.31[ > and (3.321 is therefore expected. This is an indication that we quantized correctly. 



But notice that this statistical result (that the scalar fields do not obey Pauli exclusion) 
is entirely a result of the commutation relations. Therefore, if we attempt to quantize a 
spin- 1/2 field in the same way, they will obviously not obey Pauli exclusion either. We 
must therefore quantize spin- 1/2 differently. 
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It turns out that the correct way to quantize spin-1/2 fields is to use, instead of commuta- 
tion relations like we used for for scalar fields, anticommutation relations. If the operators 
of our spin-1/2 fields obey 

{a|, a^} = a\a2 = ^ ajal — —(^2^1 
then if we try to act twice with the same operator, we have 

a\a\\0) = -a\a\\0) a{a\\0) = 

In other words, if we quantize with anticommutation relations, it is not possible for two 
particles to occupy the same state simultaneously 

This relationship between the spin of a particle and the statistics it obeys (which demands 
that integer spin particles be quantized by commutation relations and half-integer spin 
particles to be quantized with anticommutation relations) is called the Spin-Statistics 
Theorem. 

And, because particles obeying Pauli exclusion are said to have Bose-Einstein statistics, 
and particles that do not obey Pauli exclusion are said to have Fermi-Dirac statistics, we 
call particles with integer spin Bosons, and particles with half-integer spin Fermions. 



3.2.4 Left-Handed and Right-Handed Fields 



Recall that in the Dirac Lagrangian (3.12), our fundamental field was the 4-component 



spinor tjj 



R 



where tpL transforms under the left-handed (0, 1/2) representation of 



the Lorentz group, and ^pR transforms under the right-handed (1/2, 0) representation. 

In general, we refer to these 2-component spinors as Weyl fields (usually pronounced 
"vile"). So, the fermion is the spinor combination of two Weyl fields, one being the left- 
handed particle, and the other being the right-handed antiparticle. 



Also in (3.12) was the field we defined as ip = ^^7° = If we interpret ip as the 



conjugate of ^ (which the form of the Dirac Lagrangian implies we should), then we see 
that the right-handed field is the conjugate of the left, and vice versa. Or, in other words. 



R 



and 



We take advantage of the fact by writing all fields in terms of left-handed Weyl fields. For 
example, given the two left-handed Weyl fields x ^rid ^, we can form the 4-component 

spinor field ip = , and so tjj = (^, x^)- We will refer to such a field as a Dirac Field, 

and denote itipD- 



87 



On the other hand, we could define a 4-component spinor in terms of a single left-handed 
Weyl field x, or V' = ^^t^ • notice that t/S = (X; X^)/ which is equal simply to the 

transpose of ijj. We refer to such a field (whose conjugate is equal to its transpose) as a 
Majorana Field, and denote it i/jm- 

Recall that an antiparticle has the same mass but opposite charge and opposite handed- 
ness of its particle. So, working with the Dirac field i/jd, we can change the charge by 
merely swapping x and ^, using the Charge Conjugation operator C defined by 



Also, consider the transpose oiipD (which is just returning the conjugate of to column 



form), ijjj) = ( ) • Acting on this with C gives 



O ( X 



So, we have 



CtpD — TpJ) and Ci/ji]^ — ipD 
We therefore say that i/jd and ip]^ are Charge Conjugate to each other. 
However, notice that with the Majorana field. 



and 

So in summary, Dirac fields are not equal to their charge conjugate, while Majorana fields 
are. By analogy with scalars (where the complex conjugate of a real number is equal 
to itself, whereas the complex conjugate of a complex number is not), we often refer to 
Majorana fields as Real, and to Dirac fields as Complex. 

So, we can now write out the Lagrangian for Dirac and Majorana fields in terms of their 
Weyl fields: 

Cn = ^X^^''^,x + ^C^<r''^,^-mix^ + X^^^) (3-33) 
J^M = ^X^^''^^^X - -j^ixX + X^x}) (3.34) 
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3.2.5 Canonical Quantization of Fermions 



We first quantize the Dirac fermion. The general solution to the Dirac equation is 



-ik-xl 



where s =1, 2 are the two spin states, bs and dl are (respectively) the lowering operator for 
the particle and the raising operator for the antiparticle. The charge conjugate oi^'D will 
have the raising operator for the particle and the lowering operator for the antiparticle. 

The Us and Vg are constant 4-component vectors which act as a basis for all parti- 
cle /antiparticle states in the spinor space (for our purposes, they are merely present to 
make tpo 4-component field). 



We quantize, as we said in section 3.2.3 using anf z-commutation relations. Writing only 
the non-zero relation, 

{lpait,x),i)f3{t,x)} = 5^{X - x')(7°)a/3 

These imply that the only non-zero commutation relations in terms of the operators are 

{bsCk),bl,Ck')} = {2T,f5\k - k')2u5ss' 
{d\{k),ds'{k')} = {2Tif5^{k - k')2uj5ss' 



Once again, these form the algebra of a simple harmonic oscillator, and we can therefore 
find the entire spectrum of states by acting on |0) with b\ and d\. 



Then, following a series of calculations nearly identical to the ones in section 3.2.2 
arrive at the Hamiltonian 



we 



H = J2 dk uj[bl{k)bs{k) + dl{k)dsik)] 



X 



(3.35) 



where A is an infinite constant we can merely subtract off and therefore ignore. 



Comparing ( 3.29[ > and (3.351, we see that they both have essentially the same form; u 
(which is energy) to the left of the creation operator, which is to the left of the annihilation 
operator. To understand the meaning of this, we will see how it generates energy eigen- 
values. We will use equation ( 3.29[ > for simplicity. Consider acting with the Hamiltonian 
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operator on some arbitrary state \p) with momentum p. Using ( 3.30[ >, 

H\p) = I dkuka\k)a{k)\p) = J dkukO^ {k)a{k) \j2ujpa) (p)|0) 

5fco;fcy2lJj;a^(fc) ((27r)^2cjp5^(A; - p) + a"^(p)a(A;)) |0) 

dkuJk^2^pa\k){2-Kf2uOp5^{k-p)\Q) 
d^k 

ujk^/2^a\k){27rf2ujp6\k - p)\0) 



(27r)32cufc 

d^k^/2[J~pa\k)ujp5^{k-p)\0) 
= Up^/2u^pa^0) = Ljp\p) 

So, H\p) = ujp\p), where tOp = p"^ + m?, which is the relativistic equation for energy as in 
equation ( 3.25[ >. So, the Hamiltonian operator gives the appropriate energy eigenvalue on 



our physical quantum states. 

For the Dirac Hamiltonian the eigenvalue will be a linear combination of the energies of 
each type of particle. If we denote the states as Sb] pd, Sd), where the first two elements 
give the state of a 6 type particle and the second of the d type particle, we have 

H\pb, Sb;pd, Sd) = ■■■ = {ujp^ +ujpj\pb, Sb;pd, Sd) 
For Majorana fields things are simpler. We only have one type of particle, so 

i^Mix) = Y,j dk[hs{k)us{k)e'''-'' + h\{k)vs{k)e-'^-^] 



s=l 



And quantization with anticommutation relations will give 

H = J2 dkco h\{k)h,{k) 

s=l 

3.2.6 Insufficiencies of Canonical Quantization 

While the Canonical Quantization procedure we have carried out in the past several sec- 
tions has given us a tremendous amount of information (the entire spectrum of states 
for bosons, Dirac fermions, and Majorana fermions), it is still lacking quite a bit. As we 



said at the beginning of section 3.1.1 we ultimately want a relativistic quantum mechan- 
ical theory of interactions. Canonical Quantization has provided a relativistic quantum 
mechanical theory, but we aren't close to being able to incorporate interactions into our 
theory. While it is possible to incorporate interactions, it is very difficult, and in order to 
simplify we will need a new way of quantizing. 
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3.2.7 Path Integrals and Path Integral Quantization 



Perhaps the most fundamental experiment in quantum mechanics is the Double Slit ex- 
periment. In brief, what this experiment tells us is that, when a single electron moves 
through a screen with two slits, and no observation is made regarding which slit it goes 
through, it actually goes through both slits, and until a measurement is made (for exam- 
ple, when it hits the observation screen behind the double slit), it exists in a superposition 
of both paths. As a result, the particle exhibits a wave nature, and the pattern that emerges 
on the observation screen is an interference pattern — the same as if a classical wave was 
passing through the double slit-all paths in the superposition of the single electron are 
interfering with each other, both destructively and constructively. Once the electron is 
observed on the observation screen, it collapses probabilistically into one of its possible 
states (a particular location on the observation screen). 

If, on the other hand, you set up some mechanism to observe which of the two slits the elec- 
tron travels through, then the observation has been made before the observation screen, 
and you no longer have the superposition, and therefore you no longer see any indica- 
tion of an interference pattern. The electrons are behaving, in a sense, classically from the 
double slit to the observation screen in this case. 

The meaning of this is that a particle that has not been observed will actually take every 
possible path at once. Once an observation has been made, there is some probability 
associated with each path. Some paths are very likely, and others are less likely (some 
are nearly impossible). But until observation, it actually exists in a superposition of all 
possible states/paths. 

So, to quantize, we will create a mathematical expression for a "sum over all possible 
paths". This expression is called a Path Integral, and will prove to be a much more useful 
way to quantize a physical system. 

We begin this construction by considering merely the amplitude for a particle at position 
qi at time ti to propagate to q2 at time t2. This amplitude will be given by 

(g2,i2|gi,ii) = (g2|e'^(*-*^)|gi) 

To evaluate this, we begin by dividing the time interval T = t2 — ti into N+l equal inter- 
vals of length 5t — each. So, we can insert N complete sets of position eigenstates, 

(g2,t2|gi,ti) = / lldQ,{q2\e-'''''\QN){QN\e-'''''\QN-i)---{Qi\e-'''''\qi) (3.36) 
i=i 

Let's look at a single one of these amplitudes. We know that in nearly all physical theories, 
we can break the Hamiltonian up as if = ^ + ^{Q)- So, using the completeness of 
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momentum eigenstates. 



p,2 p^P'Q^ + l p-^P'Q^ 



. 27r V27r 



271 

dP' i[p'{Q,+i-Q,)-HSt] 

27r 



And taking the limit as 6t 0, 9i+i_Q± ^ g.. So, 



5t 

271 / 27r 



where the subscript on dt merely indicates where the infinitesimal time interval "ends". 
So, we can plug this into ( 3.36[ > and taking the limit as 6t 0, 



{q2,t2\quh) = / ]JdQMe-'''''\QN){QN\e-'"''\QN^i)---{Qi\e-'''%i) 

= lim r T\dQ [ '^e^'^HP'NQN-H]^tdtN[P'^^^QN^r-H] . . . ^tdti[P[Q^-H] 

/OO J 
DpPg ^-i St', dt{pq-H) 
-OO 

where I'p = H^^i dpi and I^g = H^i dqi- 

2 

And if p shows up quadratically (as it always does; then we can merely do the Gaus- 
sian integral over p, resulting in an overall constant which we merely absorb back into 
the measure when we normalize. Then, recognizing that the integrand in the exponent is 
pq — H = C, we have 

{q2,t2\qut,) = Jvq e^/'i = J Vqe'' (3.37) 



Formally, the measure of (3.37) has an infinite number of differentials, and therefore eval- 



uating it would require doing an infinite number of integrals. This is to be expected, since 
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the point of the path integral is a sum over every possible path, of which there are an 
infinite number. So, because we obviously can't do an infinite number of integrals, we 
will have to find a clever way of evaluating (3.37 1. But before doing so, we discuss what 
the path integral means. 



3.2.8 Interpretation of the Path Integral 



Equation (3.37) says that, given an initial and final configuration (gi, ti) and (g2, h)> abso- 
lutely any path between them is possible. This is the content of the Vq part: it is the sum 
over all paths. 

Then, for each of those paths, the integral assigns a statistical weight of e*"^ to it, where the 



action S is calculated using that path (recall our comments in section 1.1.1 about S being 
a functional, not a function). 

So, consider an arbitrary path qo, which receives statistical weight e^'^^'^''^ . Now, consider 
a path q' very close to qo, only varying by a small amount: q' = qo + e6qo. This will have 

statistical weight e^'^[''o+'^'?ol = e'^^'^°^^''^'^°'~^ , where f- is the Euler Lagrange derivative 



6S _ ddS _dS 
6q dt dq dq 



To make our intended result more obvious, we do a Wick rotation, taking t it, so 

dt idt, and S = J dtC —^ij dtC = iS, and e*'^ e'^. Now, the path q' = qo + e5qo gets 

weight e^'^'^o'e" 



So, if 1^ is very large, then the weight becomes exponentially small. In other words, the 
larger the variation of the action is, the less probable that path is. 

So the most probable path is the one f or the smallest value of or the path at which 



4^ = 0. And as we discussed in 

Sq 



St ■ 

this is the path of Least Action. Thus, we have 



1.1.1 

recovered classical mechanics as tne tirst order approximation of quantum mechanics 



So, the meaning of the path integral is that all imaginable paths are possible for the par- 
ticle to travel in moving from one configuration to another. However, not all paths are 
equally probable. The likelihood of a given path is given by the action exponentiated, and 
therefore the most probable paths are the ones which minimize the action. This is the rea- 
son that, macroscopically, the world appears classical. The likelihood of every particle in, 
say, a baseball, simultaneously taking a path noticeably far from the path of least action 
is negligibly small. 

We will find that path integral quantization provides an extremely powerful tool with 
which to create our relativistic quantum theory of interactions. 
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3.2.9 Expectation Values 



Now that we have away of finding (g2, hlqi, ti), the natural question to ask next is how do 
we find expectation values like (52, t2\Q{t')\qi, ti) or {q2, t2\P{t')\qi, h). By doing a similar 
derivation as in the last section, it is easy to show that 



{q2,t2\Q{t')\qi,h) 



I 



Vq Q{t')e 



We will find that evaluating integrals of this form is simplified greatly through making 
use of Functional Derivatives. For some function f{x), the functional derivative is de- 
fined by 



f{x) = 5{x - y) 



Next, we modify our path integral by adding an Auxiliary External Source function, so 
that 

C^C + f{t)Q{t) + h{t)P{t) 

So we now have 

which allows us to write out expectation values in the simple form 



{q2M\Q{t')\qiM) 



15 



f,h=0 



f,h=0 



VqQ{tY 



or 



{q2M\Pit')\qxM) 



1 5 



^{(l2,t2\qi,h) f,h 



i 5h{t') 

VqP{t')e'^ 



f,h=o 



VqP{t')e'^+'^'^^'^-^^^''^^ 



f,h=o 



So, once we have (g2, ^2^1,^1)/ we can find any expectation value we want simply by 
taking successive functional derivatives. 



94 



3.2.10 Path Integrals with Fields 



Because we can build whatever state we want by acting on the vacuum, the important 
quantity for us to work with will be the Vacuum to Vacuum expectation value, or YEV, 
(0|0), and the various expectation values we can build through functional derivatives 
((O|00|O), (0|V#|0),etc.). 



For simplicity let's consider a scalar boson 0. The Lagrangian is given in equation (3.101 
Using this, we can write the path integral 



(0|0) 



We will eventually want to find expectation values, so we introduce the auxiliary field J, 
creating 



So, for example, (0|<^|0) 



1 A 

i 5 J 



(0|0). 



V(j)e 



if d*x{Co+J<t>) 



(3.38) 



I J=0" 



Of course, we still have a path integral with an infinite number of integrals to evaluate. 
But, we are finally able to discuss how we can do the evaluation. 

We define Zq{J) = (0|0) j. Then, making use of the Fourier Transform of 0, 

Jkx ' 



we begin with the Cq part: 

So = d^xCo = 



(27l) 



- -9^ 
2 



— m 
2 



d*k 

d^k 



d^k' 
d^k' 



(27r)^ 



Ak -x 



<P{k'] 



ik'-x 



(27r) 



d X 



d^kd^k' 



2 J (27r)8 
1 f d^kd^k' 

2] "(277) 

1 r d^k 

2] (2^0^ 



^ik-x ^ik' -x 



(j){k)(j){k'){k^'k' -m'^ 



(27r)8 
d^kd^k' 



-4>{k)4>{k'){k% - m^){27ryS\k + k') 
0(A;)(fc2 + m^)0(-A;) 
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Then, transforming the auxiliary field part. 



d'^xJ{x)(f){x) 



d X 



(27r)8 
d'^kd'^k' 



d^k 



d^k' 



(27r) 
d^k 
(2^ 



J{k)(t){k') j 

J{k)4>{k'){2'KY6\k + k') 



J{k)(f){-k) 



And because the integral is over all k'^, we can rewrite this as 

d^k 



{J{k)(j){-k) + J{-k)(j){k)) 



(we did this to get the factor of 1/2 out front in order to have the same coefficient as the 
Co part from above). 



So, 



S 



1 r d'^k 



<p{k){k^ + m'^)(P{-k) + J{k)<p{-k) + J{-k)(P{k) 



Now, we make a change of variables. 



x{k) ^ m - 



j{k) 



^2 _|_ 

(Note that this leaves the measure of the path integral unchanged: T)<p T^X-) 
Plugging this in, we have, 

1 r d^k 



s 



(2vr) 



m + Tp^]ik' + m^)(xi~k)+ 



^2 _|_ ^2 



k"^ + m 



+ jmx{-k) + ^^^] + j{-k)[m + 



2 

J\k) 



1 r d^k 

2 J (2^ 



J^2 _|_ j^2 

X{k){k^ + m'M-k) + 



J^2 _|_ ^2 

J{k)J{-k) 
^2 _|_ ^2 



(The point of all of this is that, in this form, we have all of the </>, or equivalently X/ depen- 
dence in the first term, with no </> or x dependence on the second term.) 



Finally, our path integral ( 3.38[ > is 



(0|0). 



Vxe 
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Now, using some clever physical reasoning, we can see how to evaluate the infinite num- 
ber of integrals in this expression. Notice that if we set J = 0, we have a free theory in 
which no interactions take place. This means that if we start with nothing (the vacuum), 
the probability of having nothing later is 100%. Or, 

(0|0).|,^„ = 1 = 1 Vxe'^^^^'^^'^^''^-'^^^-'^^ 
And if that part is 1, then we have 

f 1 r d-^k J{k)J{-k) 

(0|0)j = / Pxe' 



And remarkably, the integrand has no x dependence\ Therefore, the infinite number of 
integrals over all possible paths becomes nothing more than a constant we can absorb 
into the normalization, leaving 



i r d'^k J(k).](-k) 



(27r)^ k'^+m'^ 



m)j = e^^' 

We can Fourier Transform back to coordinate space to get 

Zo(J) = (0|0)j = e5/'^'-'^'-'^(-)^(— (3.39) 

where 

^^""^^ = 7 (27r)4p + m2 
is called the Feynman Propagator for the scalar field. 

We can then find expectation values by operating on this with } ^ as described in section 
13:2:91 

We can repeat everything we have just done for fermions, and while it is a great deal more 
complicated (and tedious), it is in essence the same calculation. We begin by adding the 
auxiliary function f]ip + ^pr|, to get expectation values of and ip by using 4 ^ and 4 Ji, 
respectively. 

We then Fourier Transform every term in the exponent and find that we can separate out 
the and 4' dependence, allowing us to set the term which does depend on ^ and tjj equal 
to 1. Fourier Transforming back then gives 

Zo{v,v) = e^/'^'^°'*^'^(^)^(^-^')''(^') (3.40) 

where 



S{x — x) 



(27r)4 fc2 + ^2 
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is the Feynman propagator for fermion fields. 



Recall that we are calling the auxiliary fields J, rj, and f] Source Fields. Comparing the 
form of the Lagrangian in equation (3.38 ) to ( 1.13[ > reveals why. J, rj, and fj behave mathe- 
matically as sources, giving rise to the field they are coupled to, in the same way that the 
electromagnetic source gives rise to the electromagnetic field A^. The meaning behind 
equations (3.39) (and (3.40 1) is that J (or r] and ff) act as sources for the fields, creating a 
(or if) and ip) at spacetime point x, and absorbing it at point x' . The terms A(x — x') and 
S{x — x') then represent the expression giving the probability amplitude (0|0) for that par- 
ticular event to occur. In other words, the propagator is the statistical weight of a particle 
going from x to x'. 



3.2.11 Interacting Scalar Fields and Feynman Diagrams 

We can now consider how to incorporate interactions into our formalism, allowing us to 
finally have our relativistic quantum theory of interactions. 



Beginning with the free scalar Lagrangian (3.10), we can add an interaction term Ci. At 



this point, we only have one type of particle, (p, so we can only have 0's interacting with 
other (p's. Terms proportional to (p or 0^ are either constant or linear in the equations 
of motion, and therefore aren't valid candidates for interaction terms. So, the simplest 
expression we can have is 

where || is a conventional normalization, and g is a Coupling Constant. So our total 
Lagrangian is 

C = Co + Ci = -^d^<pd^<P - ^mV' + 19<P' 
and the path integral is 

zij) = mj 

gj / d'^xCi^i J d'''x[Co+J(t>\ 



But, recall that we can bring out a factor of from (0|0) j using the functional derivative 

_s_ 

5J- 



^ ^f. So, we can make the replacement 
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And notice that once this is done, there is no longer any </> dependence in Z{J). So, with 



the free theory, we were able to remove the dependence, leading to (3.39). And here. 



we were able to remove it from the interaction term as well. So, once again, the infinite 
number of integrals in ( 3.37[ > will merely give a constant which we can absorb into the 
normalization. 



This leaves the result 



Z{J) = e^^^'''"(^4^))'zo(J) 



Now, we can do two separate Taylor expansions to these two exponentials. 



SJ{x) 



X 



oo ^ 

pi 

p=o 



d^yd^zJ{y)A{y-z)J{z) 



(3.41) 



Now, recall that a functional derivative ^jj, will remove a J term. Furthermore, after 
taking the functional derivatives, we will set J = to get the physical result. So, for a term 
to survive, the 2P sources must all be exactly removed by the 3V functional derivatives. 



So, using (3.41), we can expand in orders of g (the coupling constant), keeping only the 
terms which survive, and after removing the sources, evaluate the integrals over the prop- 
agators A. The value of the integral will then be the physical amplitude for a particular 
event. 

In practice, a slightly different formalism is used to organize and keep track of each term 
in this expansion. Note that there will be P propagators A. We can represent each of these 
terms diagrammatically, by making each source a solid dot, each propagator a line, and 
let the g terms be vertices joining the lines together. There will be a total of V vertices, 
each joining 3 lines (matching the fact that we are looking at (f)'^ theory; there would be 4 
lines at each vertex for (f)'^ theory, etc.). 



For example, for V = and P = 1, 



Z{J) 



d^yd'zJ{y)A{y-z)J{z) 



We have two sources, one located at z and the other located at y, so we draw two dots, 
corresponding to those locations. Then, the propagator A{y — z) connects them together, 
so we draw a line between the two dots. The diagram should look like this: 
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Of course, once we set J = 0, this will vanish because it contains two sources. 



As another example, consider V — and P — 2. Now, 

Z{J) = ^(0'/ d'yd'zdyd'z'{j{y)A{y - z)J{z)){j{y')A{y' - zViz')) 

This corresponds to four sources, located at y,z,y' and z', with propagator lines connect- 
ing y to z, and connecting y' to z'. But, there are no lines connecting an unprimed source 
to a primed source, so this results in two disconnected diagrams: 



As another example, consider V = 1 and P = 2, 



Z{J) = -| / rf^x 



S 



1 / • \ 2 
1 / t 



6 J \SJ{x)^ 

d'yd'zdyd'z'{J{y)A{y - z)J{z)) {J{y')A{y' - z')J{z')) 

j d^xd^yd^zdyd^z'5{y - x)A{y - z)5{z - x)5{y' - x)A{y' - z')J{z') 
d^xd^z'A{x - x)A{x - z')J{z') 
This will correspond to 



2! V 2 



9_ 
48 




where the source J is located at the dot, and the vertex joining the line to the loop is at x. 

You can work out the following out, and see that there are multiple possible diagrams for 

y = 3, P = 5 
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•oo 



And f or V" = 2, P = 4, 




And forV =1,P = 3, 



and so on. 



Through a series of combinatoric and physical arguments, it can be shown that only con- 
nected diagrams will contribute, and the and ^ terms will always cancel exactly. 

So, to calculate the amplitude for a particular interaction to happen (say 0's in and M 
(f)'s out), draw every connected diagram that is topologically distinct, and has the correct 
number of in and out particles. Then, through a set of rules which you will learn formally 
in a QFT course, you can reconstruct the integrals which we started with in ( 3.41[ >. 



When you take a course on QFT, you will spend a tremendous amount of time learning 
how to evaluate these integrals for low order (they cannot be evaluated past about second 
order in most cases). While this is extremely important, it is not vital for the agenda of 
these notes, and we therefore do not discuss how they are evaluated. 

The idea is that each diagram represents one of the possible paths the particle can take, 
along with the possible interactions it can be a part of. Because this is a quantum mechan- 
ical theory, we know it is actually in a superposition of all possible paths and interactions. 
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We don't make a measurement or observation until the particles leave the area in which 
they collide, so we have no idea about what is going on inside the accelerator. We know 
that if this goes in and this comes out, we can draw a particular set of diagrams which 
have the correct input and output, and the nature of the interaction terms (which deter- 
mines what types of vertices you can have) tells us what types of interactions we can 
have inside the accelerator. Evaluating the integrals then tells us how much that partic- 
ular event/ diagram contributes towards the total probability amplitude. So, if you want 
to know how likely a certain incoming/ outgoing set of particles is, write down all the 
diagrams, evaluate the corresponding integrals, and add them up. 

And as we pointed out above, the classical behavior (which is more probable) is closer 
to the first order approximation of the quantum behavior. Therefore, even though in 
general we can't evaluate the integrals past about second order, the first few orders tell us 
to a reasonable (in fact, exceptional in most cases) degree of accuracy what the amplitude 
is. If we want more accuracy, we can seek to evaluate higher orders, but usually lower 
orders suffice for experiments at energy levels we can currently attain. 

One of the difficulties encountered with evaluating these integrals is that you almost al- 
ways find that they yield infinite amplitudes. Since an amplitude (which is a probability) 
should be between and 1, this is obviously unacceptable. The process of finding the 
infinite parts and separating them from the finite parts of the amplitude is a very well 
defined mathematical construct called Renormalization. The basic idea is that any infi- 
nite term consists of a pure infinity and a finite part. For example (trust us for now) the 
infinite sum: 



There is a part which is a pure infinity (the first term on the right hand side), and a term 
which is finite. While this may seem strange and extremely unfamiliar (and a bit like 
hand waving), it is actually a very rigorous and very well understood mathematical idea. 

Much of what particle physicists attempt to do is find theories (and types of theories) that 
can be renormalized and theories that cannot. For example, the action which leads to 
General Relativity leads to a quantum theory which cannot be renormalized. Renormal- 
ization is a fascinating and deep topic, and will be covered in great depth in any standard 
QFT text or course. Unfortunately, we will not discuss it further. 

3.2.12 Interacting Fermion Fields 

The analysis we performed above for scalar fields above is almost identical for fermions, 
and we therefore won't repeat it. The main difference is that the interaction terms will 
have a field ijj interacting with V'/ and so the vertices will be slightly different. We won't 
bother with those details. 




n=l 
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Finally, we can have a Lagrangian with both scalars and fermions. Then, naturally, you 
could have interaction terms where the scalars interact with fermions. While there are 
countless interaction terms of this type, the one that will be the most interesting to us is 
the Yukawa term, 

jCvuk = g4>i''ip (3.42) 



If we represent by a dotted line, by a line with an arrow in the forward time direction, 
and ijj with an arrow going backwards in time, this interaction term will show up in a 
Feynman diagram as 




Once each diagram is drawn, there are well defined rules to write down an integral cor- 
responding to each diagram. 



3.3 Final Ingredients 



The purpose of the previous section was merely to introduce the idea of Feynman Dia- 
grams as a tool to calculate amplitudes for physical processes. In doing so, we have met 



the goal set out in section 3.1.1 a relativistic quantum mechanical theory of interactions. 



We achieve such a theory by finding a Lagrangian of a classical theory (both with and 



without interaction terms), and using equation (3.41) (and the analogous equation for 
fermions) to write down integrals which, when evaluated, give a contribution to a total 
amplitude. It is important to remember that we will eventually set all sources J to zero, 
and a functional derivative (as contained in the interaction term Ci) will set any term 
without J's to zero. So, the only non-zero terms will be the ones where all of the J's are 
exactly removed by the functional derivatives. 

A large portion of understanding QFT is learning how to set these integrals up in greater 
detail, and learning several methods to evaluate them. We will not delve into those details 
of Perturbative Quantum Field Theory, where amplitudes are studied order by order, 
here. The goal of these notes is merely to explain how, once given a Lagrangian, that 
Lagrangian can be turned into a physically measurable quantity. 
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With this done, we now set out to find the Lagrangian for the Standard Model of Particle 
Physics, the theory which seems to explain our universe (apart from gravity). Once this 
Lagrangian has been explained, we trust you have a general concept of what to do with 
it from the previous sections. 

However, before we are able to explain the Standard Model Lagrangian, there are a few 
final concepts we need. They will be the subject of this section. Namely, we will be study- 



ing the ideas of Spontaneous Symmetry Breaking and Gauge Theories. In section 3.1.11 
we discussed the simple U{1) gauge theory, where we made a global U{1) symmetry of 
the free Dirac Lagrangian a local U{1) symmetry, or a gauged symmetry, and showed that 
consistency demanded the introduction of a gauge field A'^\ and consequently a kinetic 
term and a source term. Thus we recovered the entire electromagnetic force from nothing 
but f/(l). Later in this section, we generalize this to arbitrary Lie group. Because U{1) 



is an Abelian group, we refer to the gauge theory of section [3.1.11 as an abelian gauge 



theory. For a more general, non-Abelian group, we refer to the theory resulting as a 
Non- Abelian Gauge Theory. Such theories introduce a great deal of complexity, and we 
therefore consider them in detail in this section before moving on to the Standard Model. 

However, we begin with the idea of spontaneous symmetry breaking. 



3.3.1 Spontaneous Symmetry Breaking 



Consider a complex scalar boson (p and 0^. The Lagrangian will be 



Naturally we can write this as 



where 



£ = _l5M0t5^0_V(0t^0) (3.43) 



This Lagrangian has the U (1) symmetry we discussed in 3.1.11 
Also, notice that we can graph 0), plotting V vs. |0|, 
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We see a "bowl" with Vminimum at |0p = 0. The vacuum of any theory ends up being at the 
lowest potential point, and therefore the vacuum of this theory is at = 0, as we would 
expect. 

Now, let's change the potential. Consider 

where A and $ are real constants. Notice that the Lagrangian will still have the global 
[/(I) symmetry from before. But, now if we graph V \s. we get 




where now the vacuum Vminimum is represented by the circle at |(/)| = $. In other words, 
there are an infinite number of vacuums in this theory. And because the circle drawn 
in the figure above represents a rotation through field space, this degenerate vacuum is 
parameterized by e*", the global U{1). There will be a vacuum for every value of a, located 

at 101 = 

In order to make sense of this theory, we must choose a vacuum by hand. Because the 
theory is completely invariant under the choice of the U (1) e'", we can choose any a and 
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define that as our true vacuum. So, we choose a to make our vacuum at </> = $, or where 
(p is real and equal to $. We have thus, in a sense. Gauged Fixed the symmetry in the 
Lagrangian, and the U{\) symmetry is no longer manifest. 

Now we need to rewrite this theory in terms of our new vacuum. We therefore expand 
around the constant vacuum value $ to have the new field 

= $ + a + i/3 

where a and (3 are new real scalar fields (so = $ + a — i(3). We can now write out the 
Lagrangian as 



Am^ 

2 



(3.45) 

This is now a theory of a massive real scalar field a (with mass = V4Am^$^), a massless real 
scalar field /3, and five different types of interactions (one allowing three a's to interact, 
the second allowing one a and two P's, the third allowing four a's, the fourth allowing 
two a's and two P's, and the last allowing four /5's.) In other words, there are five different 
types of vertices allowed in the Feynman diagrams for this theory. 

Furthermore, notice that this theory has no obvious U{1) symmetry. For this reason, writ- 
ing the field in terms of fluctuations around the vacuum we choose is called "breaking" 
the symmetry. The symmetry is still there, but it can't be seen in this form. 

Finally, notice that breaking the symmetry has resulted in the addition of the massless 
field p. It turns out that breaking global symmetries as we have done always results in a 
massless boson. Such particles are called Goldstone Bosons. 



3.3.2 Breaking Local Symmetries 

In the previous section, we broke a global U{1) symmetry. In this section, we will break 
a local U{1) and see what happens. We begin with the Lagrangian for a complex scalar 
field with a gauged U{1): 

where we have taken the external source = 0. Let's once again assume V[<\)^ ■, (j)) has the 



form of equation (3.441, so the vacuum has the f/(l) degeneracy at |0| = $. 



Because our U{1) is now local, we choose a{x) so that not only is the vacuum real, but 
also so that is always real. We therefore expand 

= $ + /i (3.46) 
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where his a real scalar field representing fluctuations around the vacuum we chose. 
Now, 

£ = -l[{d^-tqA^){<!> + h)][{d, + iqA,){<!> + h)]-^F,,F^'' 
Axm^[{^ + h){^ + h) - 

- -Id^hd.h - UxmW - If-F,. - \q^^^A^ + 

where the allowed interaction terms include a vertex connecting an h and two A^'-'s, four 
h's, and three h's. 

So, before breaking, we had a complex scalar field and a massless vector field A^ with 
two polarization states (because it is a photon). Now, we have a single real scalar h with 
mass = v^4Am2$2 ^j-^^j ^ field A'^ with mass = g$. In other words, our force-carrying 
particle A'^ has gained mass! We started with a theory with no mass, and by merely 
breaking the symmetry, we have introduced mass into our theory. 

This mechanism for introducing mass into a theory, called the Higgs Mechanism, was 
first discovered by Peter Higgs, and the resulting field h is called the Higgs Boson. 

So, whereas the consequence of global symmetry breaking is a massless boson called a 
Goldstone boson, the consequence of a local symmetry breaking is that the gauge field, 
which came about as a result of the symmetry being local, acquires mass. 



3.3.3 Non-Abelian Gauge Theory 



We are now ready to generalize what we did in section 3.1.11 to an arbitrary Lie group. 



Consider a Lagrangian C with scalar (or spinor) fields (pi {i = 1, . . . , N) that is invariant 
under a continuous SO{N) or SU (N) symmetry, (pi Uij(pj, where Uij is an x matrix 
of SO{N) or SU{N). 



In section 3.1.11 we saw that if the group is f/(l), gauging it demands the introduction of 
the gauge field A'-' to preserve the symmetry, which shows up in the covariant derivative 
= dfj, — ieA^. To say a field carried some sort of charge means that it has the corre- 
sponding term in its covariant derivative. We then added a kinetic term for A^ as well as 
an external source J^. Then, higher order interaction terms can be included in whatever 
way is appropriate for the theory. 

To generalize this, let's say for the sake of concreteness that our Lie group is SU (N). An 
arbitrary element of SU{N) is e^^^"^^^^", where g is a constant we have added for later 



convenience, O"" are the — 1 parameters of the group (cf. section 2.2.15 1, and the are 
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the generator matrices for the group. Notice that we have gauged the symmetry (in that 
6{x) is a function of spacetime). 

By definition, we know that the generators will obey the commutation relations 



(cf equation (2.9 1), where fabc are the structure constants of the group. 



When gauging the U (1) in section 3.1.11 the transformation of the gauge field was given 
by equation (3.16). For the more general transformation (pi — > Uijcpj, the gauge field trans- 
forms according to 

^ U{x)A''U\x) + -U{x)d^'U\x) 
9 

(where we have removed the indicial notation and it is understood that matrix multipli- 
cation is being discussed). If U{x) is an element of U (1) (so it is e*^^*^^^), then this transfor- 
mation reduces to 



which is exactly what we had in (1.15 1. For general SU{N), however, the U's are elements 
of a Non-Abelian group, and the A'^'s are matrices of the same size. 

Generalizing, we find that a general element of the SU {N) is (changing notation slightly) 

with A^^ — 1 real parameters V^. We then build the covariant derivative in the exact same 
way as in equation ( 3.17[ > by adding a term proportional to the gauge field 

D, = I^x^S^ - ^gA, 

(Remember that each component of is an x matrix. They were scalars for U{1) 
because [/(I) is a 1 x 1 matrix.) Or, acting on the fields, the covariant derivative is 



{D^,(t>)j = d^(t)j{x) - ig[A^{x)]jk(t)k{x) 



(3.47) 



where k is understood to be summed on the last term. It will be understood from now 
on that the normal partial derivative term (the first term) has an x identity matrix 
multiplied by it. 



Then, just as in (3.191, we have the field strength 



F^^(x) = -[D^, D^] = d^A^ - d^A^ - ig[A^, A^] 



(3.48) 



where the commutator term doesn't vanish for arbitrary Lie group as it did for Abelian 

f/(l). 
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Recall from equation (1.16) that for U{1), F^,y is invariant under the gauge transforma- 



tion (1.151 on its own, because the commutator term vanishes. In general, however, the 



commutator term does not vanish, and we must therefore be careful in writing down the 
correct kinetic term. It turns out that the correct choice is 



Kin 



(3.49) 



It may not be obvious, but this form is actually a consequence of (2.28 1. There is algebraic 
machinery working under the surface of this that, while extremely interesting, is unfortu- 
nately beyond the scope of what we are doing. We will discuss all of these ideas in much 
greater depth later in this series. 

So, starting with a non-interacting Lagrangian that is invariant under the global SU (N), 
we can gauge the SU (N) to create a theory with a gauge field (or synonymously a "force 
carrying" field) A^^, which is an x matrix. So, every Lie group gives rise to a particular 
gauge field (which is a force carrying particle, like the photon), and therefore a particular 
force. 



For this reason, we discuss forces in terms of Lie groups, or synonymously Gauge 



Groups. Each group defines a force. As we said at the very end of section 2.2.11 U{1) rep- 



resents the electromagnetic force (as we have seen in section 3.1.11 while SU (2) describes 
the weak force, and SU (3) describes the strong color force. 



3.3.4 Representations of Gauge Groups 



As we discussed in section 2.2 given a set of structure constants fabc, which define the 
Lie algebra of some Lie group, we can form a representation of that group, which we 
denote R. So, R will be a set of D(R) x D(R) matrices, where D is the dimension of the 
representation R. We then call the generators of the group (in the representation R) Tg, 
and they naturally obey [T^, T^] = tfabcT^. 

One representation which exists for any of the groups we have considered is the represen- 
tation of SO{N) or SU{N) consisting of x matrices. We denote this the Fundamental 
Representation (also called Defining Representation in some books). Clearly, the funda- 
mental representations of S0{2), 50(3), SU{2), and SU{3) are the 2 x 2, 3 x 3, 2 x 2, and 
3x3 matrix representations, respectively. We will denote the fundamental representation 
for a given group by writing the number in bold. So, the fundamental representation of 
SU{2) will be denoted 2, and the generators for SU{2) in the fundamental representation 
will be denoted T2 . Obviously, the fundamental representation of SU (3) will be 3 with 
generators T3 . 

Furthermore, let's say we have some arbitrary representation generated by T^, obeying 
[T^, T^] = ifabcT^- We can take the complex conjugate of the commutation relations to get 
[T^'^, T^^] = -ifabcT^. So, notice that if we define the new set of generators T'^ = -T^", 
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then the T'^ will obey the correct commutation relations, and will therefore form a rep- 
resentation of the group as well. If it turns out that T'^ = — (T^)* = T^, or if there is 
some unitary similarity transformation U"^T^U such that T'j^ = -{T^y = Tg, then 

we call the representation Real, and the complex conjugate of the T^'s is the same repre- 
sentation. However, if no such transformation exists, then we have a new representation, 
called the Complex Conjugate representation to R, or the Anti-i? representation, which 
we denote R. 

For example, there is the fundamental representation of SU (3), denoted 3, generated by 
T3 , and then there is the anti-fundamental representation 3, generated by Tg . 

The representations of a group which will be important to us are the fundamental, anti- 
fundamental, and adjoint. 



3.3.5 Symmetry Breaking Revisited 



As we said in section 3.3.3 given a field transforming in a particular representation R, the 



gauge fields will be D{R) x D{R) matrices. 

Once we know what representation we are working in, and therefore know the generators 
Tg, it turns out that it is always possible to write the gauge fields in terms of the genera- 



tors. Recall in sections [2.2.2 and 2.2.12 we encouraged you to think of the generators as 
basis vectors which span the parameter space for the group. Because the gauge fields live 
in the N x N space as well, we can write them in terms of the generators. That is, instead 
of the gauge fields being N x N matrices on their own, we will use the N x N matrix 
generators as basis vectors, and then the gauge fields can be written as scalar coefficients 
of each generator: 

A" = A^T^ (3.50) 

where a is understood to be summed, and each A'^ is now a scalar function rather than a 
D(R) X D(R) matrix (the advantage of this is that we can continue to think of the gauge 
fields as scalars with an extra index, rather than as matrices). As a note, we haven't 
done anything particularly profound here. We are merely writing each component of the 
D{R) X D{R) matrix A^ in terms of the D{R) x D{R) generators, allowing us to work 
with a scalar field rather than the matrix field A^. We now actually view each A'^ as a 
separate field. So, if a group has N generators, we say there are N gauge fields associated 
with it, each one having 4 spacetime components pL. 

In matrix components, we will have 

= {A^J^nh 



Then, the covariant derivative in ( 3.47| > will be 



= d^<Pj{x) - ig[A^a{x)T%]jk<Pk{x) (3.51) 
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We may assume that the field strength F'^'^ can also be expressed in terms of the genera- 
tors, so that we have 

F^" = F^^T" (3.52) 

or 

{Fnij = {FrTnij (3.53) 



Now, using (2.28 1 (and taking k = 1/2 by convention), we can write (3.49 ) in terms of the 
new basis: 



1 



Kin 



TT{F,,Fn 



ab 



— F^F^-^K 



(3.54) 



(we have raised the index a on the second field strength term in the last two lines simply 
to explicitly imply the summation over it. The fact that it is raised doesn't change its value 
in this case; it is merely notational). 

Furthermore, we can use ( 2.28| > to invert (3.52): 

Tr (F^'^T^) = Tr (T'^T^) 
Tt{F^"'T^) 



Fr^S' 



ab 



Tr (F^^T^^ = -F'^" 
y ' 2 ^ 

= 2Tr(F'^^T'^) 



(3.55) 



In sections 3.3.1 and 3.3.2 we broke the U{1) symmetry, which only had one generator. 
However, if we break larger groups we may only break part of it. For example, we will 
see that SU (3) has an SU (2) subgroup. It is actually possible to break only the SU (2) part 
of the SU{?>). So, three of the SU{'i) generators are broken (the three corresponding to 
the SU{2) subgroup /subalgebra), and the other five are unbroken. Because we are now 
writing our gauge fields using the generators as a basis, this means that three of the gauge 
fields are broken, while five of the gauge fields are not. 



Finally, recall from section 3.3.2 that breaking a local symmetry results in a gauge field 



gaining mass. We seek now to elucidate the relationship between breaking a symmetry 
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and a field gaining mass. First, we can summarize as follows: Gauge fields corresponding 
to broken generators get mass, while those corresponding to unbroken generators do not. The 
unbroken generators form a new gauge group that is smaller than the original group that was 
broken. 



In 3.3.2 we saw that breaking a symmetry gave the gauge field mass. Now, we see that 



giving a gauge field mass will break the symmetry. 

To make this clearer, we begin with a very simple example, then move on to a more 
complicated example. 



3.3.6 Simple Examples of Symmetry Breaking 

Consider a theory with three real massless scalar fields 0j (i = 1,2,3) and with Lagrangian 
which is clearly invariant under the global 5*0(3) rotation 

where R^^ is an element of SO (3), because the Lagrangian is merely a dot product in field 
space, and we know that dot products are invariant under 5*0(3). 

Now, let's say that one of the fields, say (pi, gains mass. The new Lagrangian will then be 

C = -^d^<P.d,<P'-^m'<Pl 

So this Lagrangian is no longer invariant under the full 5*0(3) group, which mixes any 
two of the three fields. Rather, it is only invariant under rotations in field space that mix 
02 and 03 or 5*0(2). In other words, giving one field mass broke 50(3) to the smaller 

SO (2). 

As another simple example, if we started with five massless complex scalar fields 0j, with 
Lagrangian 

C = -\d^cp%cp^ 
This will be invariant under any SU (5) transformation. 

Then let's say we give two of the fields, <pi and 02 (equal) mass. The new Lagrangian will 
be 

C = -]^d^cP%cP' - ^m(0l0i + 0|02) 
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So now, we no longer have the full SU{5) symmetry, but we do have the special unitary 
transformations mixing 03, 04, and 05. This is an SU (3) subgroup. Furthermore, we can 
do a special unitary transformation mixing 0i and 02- This is an SU (2) subgroup. So, we 
have broken SU{5) SU{3) SU{2). 

Before considering a more complicated example of this, we further discuss the connection 
between symmetry breaking and fields gaining mass. 



When we introduced spontaneous symmetry breaking in section 3.3.1 recall that we 
shifted the potential minimum from Vminimum at = to Vminimum at |0| = $. But we 
were discussing this in very classical language. We can interpret all of this in a more 
"quantum" way in terms of VEV's. As we said, the vacuum of a theory is defined as the 
minimum potential field configuration. For the Vminimum at = potential, the VEV of 
the field was at 0, or 



= 

However, for the Vminimum at |0| = $ potential, we have 

(O|0|O) = $ 

So, in quantum mechanical language, symmetry breaking occurs when a field, or some 
components of a field, take on a non-zero VEV. 

This seems to be what is happening in nature. At higher energies, there is some "Mas- 
ter Theory" with some gauge group defining the physics, and all of the fields involved 
have VEV's. At lower energies, for whatever reason (the reason for this is not well un- 
derstood at the time of this writing), some of the fields take on non-zero VEV's, which 
break the symmetry into smaller groups, giving mass to certain fields through the Higgs 



Mechanism discussed in section 3.3.2 We call the theory with the unbroken gauge sym- 



metry at higher energies the more fundamental theory (analogous to equation (3.43 )), and 



the Lagrangian which results from breaking the symmetry (analogous to (3.45 1) the Low 
Energy Effective Theory. 

And this is how mass is introduced into the Standard Model. It turns out that if a theory 
is renormalizable one can prove that any lower energy effective theory that results from 
breaking the original theory's symmetry is also renormalizable, even if it doesn't appear 
to be. And, because the actions that appear to describe the universe at the energy level 
we live at (and the levels attainable by current experiment) are not renormalizable when 
they have mass terms, we work with a larger theory which has no massive particles but 
can be renormalized, and use the Higgs Mechanism to give various particles mass. So, 
whereas the physics we see at low energies may not appear renormalizable, if we can find 
a renormalizable theory which breaks down to our physics, we are safe. 

Now, we consider a slightly more complicated (and realistic) example of symmetry break- 
ing. 
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3.3.7 A More Complicated Example of Symmetry Breaking 



Consider the gauge group SU {N), acting on complex scalar fields (pi {i = 1, . . . , N) in 



the fundamental representation N. Recall that in section 3.3.2 in order to get equation 
( 3.46[ >, we made use of the U (1) symmetry to make the vacuum, or the VEV, real. We can 
now do something similar: we make use of the SU (N) to not only make the VEV real, but 
also to rotate it to a single component of the field, (p^. In other words, we do an SU{N) 
rotation so that 



(O|0i|O) 
{O|07v|O) 



for 



i = l N -1 



i = 1, 



So, we expand 0Ar around this new vacuum: 

= (pi for 

0JV = $ + X 

This means that, in the vacuum configuration, the fields will have the form 

/0i\ 

h 







\<PnJ 



vacuum 



So, how will the action of SU{N) be affected by this VEV? If we consider a general element 
of SU{N) acting on this. 



U2I U22 



\Uni U 



N2 



Uin\ 
U2N 



UnnJ 



/0\ 



U2N 



\UnnJ 



So, only elements of SU (N) with non-zero elements in the last column will be affected by 
this VEV. But the other — 1 elements' rows and columns are unaffected. This means 
that we have an SU{N — 1) symmetry left. Or in other words, we have broken SU (N) — > 
SU{N - 1) with this VEV. 

Let's consider a specific example of this. Consider SU (3). The generators are written out 
in (2.461. Notice that exactly three of them have all zeros in the last column; A^, A^, and 
A^. We expect these three to give an SU{3 — 1) = SU{2) subgroup. And looking at the 
upper left 2x2 boxes in those three generators, we can see that they are the Pauli matrices, 
the generators of SU (2). So, if we give a non-zero VEV to the fields transforming under 
SU (3), we see that they do indeed break the SU (3) to SU (2). The other five generators of 
SU (3) will be affected by the VEV, and consequently the corresponding fields will acquire 
mass. 
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3.4 Particle Physics 



3.4.1 Introduction to the Standard Model 

We are finally ready to study the Standard Model of Particle Physics, which (except for 
gravity) appears to be the theory which explains our universe. To state the Standard 
Model in the simplest possible terms, it is 



A Yang-Mills (Gauge) Theory with Gauge Group 
SU{3)0 SU{2)®U{1) 
with left-handed Weyl fields fields in three copies of the representation 
(1, 2, -1/2) © (1, 1, 1) © (3, 2, 1/6) © (3, 1, -2, 3) © (3, 1, 1/3) 
(where the last entry specifies the value of the U{1) hypercharge), 
and a single copy of a complex scalar field in the representation 

(1,2,-1/2) 



Admittedly, our exposition will be somewhat cursory This is largely because every con- 
cept and tool we use in this section has been discussed in detail in the previous sections. 
The purpose of these notes is to provide an introduction to the primary concepts and 
mathematical tools used in Particle Physics, not to give the details of the theory. We will 
cover the main points of the Standard Model, but there is tremendous detail we are skip- 
ping over. A second reason the following section is cursory is that we will not be working 
out every step in detail, as we have been doing. For nearly all calculations being done in 
this section, we have worked out a similar tedious calculation previously. We will there- 
fore frequently refer to previous sections/ equations. It will be worthwhile to go back and 
carefully study the parts which we refer to. 

Because this section is slightly more experimental, or at least phenomenological, than the 
rest, and because the general purpose of these notes is to develop the mathematical tools 
and framework of particle physics (especially gauge theory), undue attention should not 
be given to this section. The purpose is merely to show, as briefly as possible, where 
everything we have done so far lines up with experiment. It will be useful to read through 
this section, but do not spend too much time bogged down in the details. 

Before diving into this in detail, look over the general structure of the Standard Model on 
page [139 
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3.4.2 The Gauge and Higgs Sector 



We begin our exposition with the Electroweak part of the Standard Model gauge group, 
the SU (2) (g) U{1) part, as well as the Higgs. 

Beginning with the Higgs, a scalar field in the (2, —1/2) representation of S'f/(2)®f/(l), the 



first step is to write down the covariant derivative as in (3.51 ). We denote the generators 





l a 




2 






^0 





as A^. The generator of U{l)isY = C where C is the hypercharge (—1/2 in this 

case), and the f/(l) gauge field is B^. So, the covariant derivative is 

{D^4>), = d,<P, - i[g2AlT^ + g^B^Y],,<p^ (3.56) 

where gi and g2 are coupling constants for the U (1) part and the SU (2) part, respectively. 
If the reason we wrote it down this way isn't clear, compare this expression to equation 



(3.51 1, and remember that we are saying the field carries two charges; one for SU (2) and 
one for U{1). Therefore, it has two terms in its covariant derivative. And, as usual, ;U is a 
spacetime index. 

Knowing that the generators of SU (2) are the Pauli matrices, we can expand the second 
part of the covariant derivative in matrix form. 



g2Al-g,B^ g2{Ay tA^) 
2 \g2iAl + tAl) -g^AygiB, 

So, the full covariant derivative is 

^^'^'^^^ \D,<P2) \d,h + '-f{Al + iAl)ct^,-l{g2Al + g,B,)ct^2) ^"'^'^ 

Now, we know that the Lagrangian will have the kinetic term and some potential: 

C = -]^D,4d>^<P,-V{4>\cP) (3.58) 



Let's assume that the potential has a similar form as equation ( 3.44[ > (we add the factors 



of one-half here for the sake of convention; they don't amount to anything other than a 
rescaling of A and $), 

V{cl>\ct>) = \\{cl>^-U'^ (3.59) 
Clearly the minimum field configuration is not at = 0, but at |0| = So, following 



what we did in section 3.3.7 we make a global SU{2) transformation to put the entire 
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VEV on the first component of (p, and then make a global U{1) transformation to make the 
field real. So, 



1 fv 



(3.60) 



and we expand (p around this new vacuum: 

1 fv + h{xy 
^^'^=V=2[ 

Remember that we have chosen our SU{2) to keep the second component and our U{1) 
to keep the first component real. So, h{x) is a real scalar field. 



(3.61) 



Clearly, plugging this into the covariant derivative ( 3.57[ > will give the exact same expres- 
sion as before, but with 0i replaced by -^h{x) and 02 replaced by 0, plus an extra term 
for V. When we plug this extra term into the kinetic term in the Lagrangian (3.58 1, we get 
that it is 



(1 0) 



g^A'^ - giB, 

g2{Al + iAl -g^Al-g^Bj \g2{A^^ + iA^^^) -g,A^>^ - g,B^ 



(3.62) 



Before multiplying this out, we employ a trick. Define the Weak Mixing Angle 

,-1 l'9i 



9w = tan 

and the shorthand notation 

s,,, = sin 6,,, and 
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c„, = cos6'„ 



And finally, we can define four new gauge fields as linear combinations of the four we 
have been using: 



w: 



A, 



CiiiA, 



These can easily be inverted to give the old fields in terms of the new fields. 



- 




42 - 




43 - 




B, = 
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(3.63) 

(3.64) 

(3.65) 
(3.66) 

(3.67) 

(3.68) 

(3.69) 
(3.70) 



We make a few observations about these fields before moving on. First of all, they are 



merely linear combinations of the gauge fields introduced in equation (3.56). Second, 
notice that the two fields are both linear combinations of fields corresponding to non- 
Cartan generators of SU{2), whereas and are both linear combinations of fields 
corresponding to Cartan generators of SU{2) and U{1). So, according to our discussion in 
section 2.2.16 we expect that and will interact but not change the charge, and that 
will interact and change the charge. Incidentally, notice that W^j" has the exact form 



of the raising and lowering operators defined in ( 2.18[ >. 



With these fields defined, we can now rewrite (3.62) as 



(the -k is there because that matrix element will always be multiplied by 0, so we don't 
bother writing it), where we have defined 

M^ = — and Mz = = — 

So, we see that, by symmetry breaking, we have given mass to the W^, the W~, and the 
Zf^ fields. However, the has not gained mass. 

These particles are the W and Z vector bosons, which are the force carrying particles of 
the Weak Force. Each of these particles has an extremely large mass {Mw Ri 80.4 GeV, 
and Mz ~ 91.2 GeV), which explains why they only act over a very short range (~ 10^^^ 
meters). 

Also note that the remains massless, implying that it did not acquire a VEV, and be- 
cause it is a single generator, we see that a single U{1) remains unbroken. This U{1) and 



A^ are the gauge group and field of Electromagnetism, as discussed in section 3.1.11 



The idea of all of this is that at very high energies (above the breaking of the SU {2)0U{1)), 
we have only a Higgs complex scalar field, along with four identical massless vector 
boson gauge fields (Aj^, A'j^, A^^, B^), each of which behave basically like a photon. At 
low energies, however, the SU{2) ® U{1) symmetry of the Higgs is broken, and the low 
energy effective theory consists of a linear combination of the original four fields. Three 
of those linear combinations have gained mass, and one remains massless, retaining the 
photon-like properties from before symmetry breaking. The theory above the symmetry 
breaking scale is called the Electroweak Theory (with four photon-like force carrying 
particles), whereas below the breaking scale they become two separate forces; the Weak 
and the Electromagnetic. This is the first and most basic example of unification we have 
in our universe. At low energies, the electromagnetic and weak forces are separate. At 
high energies, they unify into a single theory that is described by SU{2) ® U (1). 

We can express the new fields as simple Euler rotations of the old fields: 

A J \Al sin + B, cos 6^) ^ \aJ '^^''^> \B^ 
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So, the is a massive linear combination of the and 5^, while the photon is a 
massless linear combination of the two. 

We can do the same type of analysis for the W^, where they are both massive linear 
combinations of and A^. The and are both made up of a mixture of the SU{2) 
and U (1) gauge groups, whereas the W^^ come solely from the SU (2) part. 

Before moving on to include leptons (and then hadrons), we first write out the full La- 
grangian for the effective field theory for h{x) and the gauge fields. 

We start with the complete Lagrangian term for h{x). We have written the original field 
as in equation (3.61 ). So, our potential in equation ( 3.59[ > is now 



y(0t,0) = -A(0V 



2\2 



4 4 16 



The first term on the right hand side is clearly a mass term giving the mass of the Higgs 
(= \J\v), and the second two terms are interaction vertices. The kinetic term for the 
Higgs will be the usual — \d^hd'^h. 



Now, following loosely what we did in section 3.1.11 we want to find kinetic terms for 
the gauge fields. We start by finding them for the original gauge fields before symmetry 
breaking (A^,^^,^^ and B^). Using \3.55), (|3.48|), and (|3.50|), and the SU{2) structure 



constants given in equation (2.171, we have 

FL = 2Tr(F^,ri) 



2Tr {{d^A,-d,A,-ig^[A,,A,])T^) 

2 Tr ((a^A^T° - d^A^T'' - ig2AlAl[T\ T'])T^) 

2 Tr {d^AlT^T^ - d^A^^T^T^ - ig2AlAlif''^'^rT^) 

d^Al5''^ - d^A^d'^^ + g^A^Alr'^d''^ 

d,Al - d,A\ + g^A^Alr^' 

dX - d.A\ + g^A^AY"^^ 

d.Al-d^Ai + gMlAl-AlAl) 



And similarly. 



III/ 



d.Al + g^iAlAl-AlAl 
d,Al-d,Ai + g2{A\Al-AlAl 



And the gauge field corresponding to the U{\) will be defined as in ( 3.20[ ): 



B 



duB, 



So, we can now write the kinetic term for our fields according to equation (3.54 1: 

'-'Km ^ a ^ M'^ 
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We can then use (3.67-3.70) to translate these kinetic terms into the new fields. We will 
spare the extremely tedious detail and skip right to the Lagrangian: 

+ie{F'"' + cot e^Z^'')W+W- 



1 f h\ ^ 

-{M^W+'-W- + -MlZ^Z,) [l + - j 

2 ^ 2 2 V S 

where we have chosen the following definitions: 

F^u = d^Ay — dyA^ (Electromagnetic Field Strength) 
Z^,y = d^Zy - dyZ^ (Kinetic term for Z^) 
F>^, = d^- ie{A^ + cot d^Z^) 

and the rest of the terms were defined previously in this section. 



3.4.3 The Lepton Sector 

We now turn to the lepton sector (which is still in the SU {2) ®U{1) part of the Standard 
Model gauge group). A Lepton is a spin- 1/2 particle that does not interact with the SU (3) 
color group (the strong force). There are six Flavors of leptons arranged into three Fami- 



lies, or Generations. The table on page 139 explains this. The first generation consists of 



the electron (e) and the electron neutrino {ue), the second generation the muon (/i) and the 
muon neutrino (z/^J, and the third the tau (r) and tau neutrino {ur)- Each family behaves 
exactly the same way, so we will only discuss one generation in this section (e and z/g). To 
incorporate the physics of the other families, merely change the e to either a or r, and 
the Ue to a or Vr in the following notes. 

What we will see is that, in a sense, the neutrinos don't really interact with anything on 
their own (which is why they are incredibly difficult to detect). For this reason, neutrinos 



don't have their own place in a representation of SU (3) ® SU (2) ®U{1) (see table on 139 1. 
Electrons on the other hand, do interact with other things on their own, and we therefore 
see them in the (1,1) representation. 

However, the neutrino does interact with other things as fart of an SU (2) doublet with the 
electron, 

/ = h\ (3.71) 
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This is why it is arranged as it is on page 139 with the electron under the (2, —1/2) repre- 
sentation of 5f/(2) O f/(l). 



This may seem confusing, but we hope the following will make it clear. We will proceed 
in what we believe is the clearest way to see this (primarily following [32]). We start with 



2 fields, e and /, where e is a single left-handed Weyl field (see section 3.2.41, and / is 
defined in (3.71 1. As we have said, / is in the (2, —1/2) representation, e is in the (1, 1) 
representation, and Ue has no representation of its own. 



So, mimicking what we did in equation (3.56 1 in the previous section, we can write down 
the covariant derivative for each field. 



(3.72) 
(3.73) 



The field e has no SU{2) term in its covariant derivative because the 1 representation of 
SU{2) is the trivial representation - this means it doesn't carry SU{2) charge. Also, we 
know that 



Y, = -- 



1 fl 



2 VO 1 



(3.74) 



and 



= (1) 



1 
1 



(3.75) 



Following the Lagrangian for the spin-1/2 fields we wrote out in equation (3.11 ), we can 
write out the kinetic term for both (massless) fields: 



Kin 



(3.76) 



At the end of section 3.2.2 and of section 3.2.11 we briefly discussed the idea of renormal- 
ization. We said that certain theories can be renormalized and others cannot. It turns out 
(for reasons beyond the scope of these notes) that while the theory we have outlined so 
far is renormalizable, if we try to add mass terms for and / and e fields, the theory breaks 
down. Therefore we cannot add a mass term. But, we know experimentally that electrons 
and neutrinos have mass, so obviously something is wrong. We must incorporate mass 
into the theory, but in a more subtle way than merely adding a mass term. It turns out 
that we can use the Higgs mechanism as follows. 



While adding mass terms renders the theory inconsistent, we can add a Yukawa term (cf . 



equation (3.42 1), 



Yuk 



-ye' 



Le + h.c. 
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where y is another coupling constant, e*-' is the totally antisymmetric tensor, and h.c. is the 
Hermitian Conjugate of the first term. 



Now that we have added Cyuk to the Lagrangian, we want to break the symmetry exactly 
as we did in the previous section. First, we replace 0i with -^{v + h{x)) and 02 with 0, 
exactly as we did in equation (3.61 1. So, 



C 



Yuk 



IjC + h.c. 

-y{<Pik - 02^i)e + h.c. 

—y{v + h)l2e + h.c. 

V 2 



y{v + h)ee 



V2 



V2 



yiy + h)ee 



(3.77) 



where S 



is the Dirac field for the electron (e is the electron and is the anti- 



electron, or positron). Comparing (3.77) with ( 3.12[ >, we see that it is a mass term for the 
electron and positron. 

Now we want a kinetic term fo r the n eutrino. It is believed that neutrinos are described 
by Majorana fields (see section 



3.2.41, so we begin with the field N' = (^{^ . Now, 



we 



employ a trick. Referring back to equations (3.33 1 and ( 3.34[ >, the kinetic term for Majorana 



fields has only one term (because Majorana fields have only one Weyl spinor), whereas 
the Dirac field sums over both Weyl spinors composing it. So, instead of working with 
the Majorana field J\f', we can instead work with the Dirac field 



So, the Dirac kinetic term iM'^^d^^.M will clearly result in the correct kinetic term from 



(3.761, or iv'^a^d^v. 



Now, continuing with the symmetry breaking, we want to write the covariant derivative 



(3.721 and (3.731 in terms of our low energy gauge fields (3.63-3.661. 



We said in the previous section (which echoed our discussion in section 2.2.161 that the 
gauge fields corresponding to Cartan generators (A^ and act as force carrying parti- 
cles, but do not change the charge of the particles they interact with. On the other hand, 
the non-Cartan generators' gauge fields (VF^) are force carrying particles which do change 
the charge of the particle they interact with. Therefore, to make calculations simpler, we 
will break the covariant derivative up into the non-Cartan part and the Cartan part. 
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The non-Cartan part of the covariant derivative (3.72 1 is 

1 



-i 

1 



:92 





4 + < 
W+ 



92 f 

V2 \W- 



and the Cartan part is 



g2AlT' + g,B,Y 



= e{A^ + cot ^^Z^)r3 + e{A^ - tan e^Z;)Y 
= e{T^ + Y)A^ + e(cot O^T^ - tan 

We have noted before that A^ is the photon, or the electromagnetic field, and e is the 
electromagnetic charge. Therefore, the linear combination + F must be the generator 
of electric charge. Notice that the electromagnetic generator is in a linear combination of 
the two Cartan generators of SU (2) ®U{1). 



We know that = ^a^, and Yi and Yg are defined in equations (3.741 and (3.751, so we 
can write 



Yil 



1 
-1 

1 
1 



And we know that e carries no charge, so its eigenvalue is 0, while Fg is +1. So, 
summarizing all of this. 



TV, 



1 

2 ' 



1 



1 



Ye = — e 
2 



T^e = 
Ye = +e 

Then defining the generator of electric charge to be Q = + Y, we have 

Que = Qe = —e Qe = +e 

So the neutrino z/, has no electric charge, the electron e has negative electric charge, and 
the antielectron, or positron, has plus one electric charge — all exactly what we would 
expect. 

We can now take all of the terms we have discussed so far and write out a complete 
Lagrangian. However, doing so is both tedious and unnecessary for our purposes. 
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The primary idea is that electrons /positrons and neutrinos all interact with the SU (2) 
f/(l) gauge particles, the W^, Z^, and A^. The and (the Cartan gauge particles) 
interact but do not affect the charge. On the other hand, the act as SU{2) raising 
and lowering operators (as can easily be seen by comparing (3.63) and (3.641 to equation 
(2.181). The SU{2) doublet state acted on by these raising and lowering operators is the 
doublet in equation (3.71 1. The interacts with a left-handed electron, raising its elec- 
tric charge from minus one to zero, turning it into a neutrino. However does not 
interact with left-handed neutrinos. On the other hand, W~ will lower the electric charge 
of a neutrino, making it an electron. But W~ will not interact with an electron]^ 



3.4.4 The Quark Sector 



A Quark is a spin-1/2 particle that interacts with the SU (3) color force. Just as with lep- 
tons, there are six flavors of quarks, arranged in three families or generations (see the 
table on page 139[ >. 

Following very closely what we did with the leptons, we work with only one generation. 
Extending to the other generators is then trivial. To begin, define three fields: q, u, and J, 
in the representations (3, 2, +1/6), (3, 1, -2/3), and (3, 1, +1/2) of 5[/(3) ® SU{2) ® U{1). 
The field q will be the SU{2) doublet 



(3.78) 



This is exactly analogous to equation ( |3.71[ >. 

Again, following what we did with the leptons, we can write out the covariant derivative 

^i\--zn-npj -^ly^Bf^qai (3.79) 

^ B^u" (3.80) 
J" (3.81) 



for all three fields: 






iDf,q)ai 


9fj,qai 


- ig,Al{T^)iqp, 










= 





3 



where i is an SU{2) index and a is an SU{?>) index. The 5'f/(3) index is lowered for the 3 
representation and raised for the 3 representation. 

Just as with leptons, we cannot write down a mass term for these particles, but we can 
include a Yukawa term coupling these fields to the Higgs: 

Cvuk = -y'e'^(j)iqajd'" - y"(f)^'qaiu" + h.c. 



^This does not mean that no vertex in the Fe5rnman diagrams will include a W and an electron field, 
but rather that if you collide an electron and a W~, there will be no interaction 
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As with the leptons, we can break the symmetry according to equation (3.61 1, and writing 
out this Yukawa term, we get 



Yuk 



v 2 V 2 



where we have defined the Dirac fields for the up and down quarks: 



Notice that, whereas both the up and down quarks were massless before breaking, they 
have now acquired masses 



y'v y"v 



Writing out the non-Cartan and Cartan parts of the covariant derivatives in terms of the 
lower energy SU (2) (g) U (1) gauge fields, we get 



2 rjil 
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w: 



g^AlT' + g.B.Y 



\W- 
eQA, + —{r^-slQ)Z, 



And it is again straightforward to find the electric charge eigenvalue for each field: 



2 1 _ 

Qu = +-U Qd = —-d Qu 



Qd = +-d 



Again, we can collect all of these terms and write out a complete Lagrangian. But, doing 
so is extremely tedious and unnecessary for our purposes. 



The primary idea to take away is that the SU{2) doublet (3.78 1 behaves exactly as the lep 



ton doublet in (3.71 ) when interacting with the "raising" and "lowering" gauge particles 
W^. This is why the u and d are arranged in the SU (2) doublet q in (3.781, and why q 
carries the SU (2) index i in the covariant derivative ( 3.79[ >, whereas u and d carry only the 
SU{3) index. 

The SU (3) index runs from 1 to 3, and the 3 values are conventionally denoted red, green, 
and blue (r, g, b). These obviously are merely labels and have nothing to do with the colors 
in the visible spectrum. 
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The eight gauge fields associated with the eight SU (3) generators are called Gluons, and 
they are represented by the matrices in (2.48). We label each gluon as follows: 



(rr rg rb^ 
gf gg gb 
bf bg bb^ 

so that the upper index is the anti-color index, and denotes the column of the matrix. 



and the lower index is the color index denoting the row of the matrix. Then, from (2.481, 
consider the gluon 

'0 1 0^ 
oc I 



.0 0, 



and the quarks 



It is easy to see that this gluon will interact as 

g^Qr =0 9fqg = Qr 9r(lb = 

Or in other words, the gluon with the anti-green index will only interact with a green 
quark. There will be no interaction with the other quarks. Multiplying this out, and 
looking more closely at the behavior of the SU{3) generators and eigentstates as discussed 
in sections 2.2.12 -2.2.15 you can work out all of the interaction rules between quarks and 



gluons. You will see that they behave exactly according to the root space of SU (3). 
3.5 References and Further Reading 

The primary source for these notes is 1132 L which is an exceptionally clear introduction to 
Quantum Field Theory. We also used a great deal of meterial from ||3l, ||24|. |l29J, and [35], 
all of which are outstanding QFT texts. The derivation of the Dirac equation came from 
pT|, which is written mostly above the scope of these notes, but is an excellent survey of 
some of the mathematical ideas of Non-Perturbative QFT and Gauge Theory. 

The sections on the Standard Model come almost entirely from [62| with little change, in 
that Srednicki's exposition could hardly be improved upon for the scope of these notes. 



For further reading, we also recommend lH, lUIl, lEH, EH, and lEZB. 
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4 The Standard Model — A Summary 



4.1 How Does All of This Relate to Real Life? 



In the fifth century B.C., a Greek named Empedocles took the ideas of several others 
before him and combined them to say that matter is made up of earth, wind, fire, and 
water, and that there are two forces. Love and Strife, that govern the way they grow and 
act. More scientifically, he was saying that matter is made of smaller substances that 
interact with each other through repulsion and attraction. Democritus, a contemporary 
of Empedocles, went a step further to say that all matter is made of fundamental particles 
that are indestructible. He called these particles atoms, meaning "indivisible']^ |[T4|. 

The field of particle physics seeks to continue studying these same concepts. Are there 
fundamental, indivisible particles and if so, what are they? How do they behave? How 
do they group together to form the matter that we see? How do they interact with each 
other? 

The current answer to these questions is called the Standard Model, the theory we spent 
this paper developing. We have now spent more than one hundred pages expositing a se- 
ries of mathematical tricks for various types of "fields". In doing so, we talked about 
"massless scalars with f/(l) charge", and about things "in a j = ^ representation of 
SU{2)". But one could easily be left wondering how exactly this relates to the things 
we see in nature. We only discussed 25 particles in the previous section and in the table 



on page 139 (particles and antiparticles), but you are likely aware that there are hundreds 
of particles in nature. What about those? How does the mathematical framework detailed 
so far form the building blocks for the universe? 

While we wish to reiterate that the primary purpose of these notes is to provide the math- 
ematical tools with which particle physics is done, and not to outline the phenomenolog- 
ical details of the theory, we are physicists still — not mathematicians. Therefore, before 
concluding this paper, we will take a brief hiatus from the mathematical rigor and look at 
a qualitative summary of particle physics. 

Throughout this section, the footnotes will provide brief explanations of the analogous 
mathematical ideas from above. This sectiorijcan be read with or without paying atten- 
tion to them. We provide them merely for those curious. 



^Of course, our modem use of the word is different. At their discovery, it was thought that different 
elements were the indivisible particles sought for, so the name atom seemed appropri ate 



5 



Nearly everything in this section is adapted from jH, including the tables on page 134 
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4.2 The Fundamental Forces 



The two forces most familiar to people are Gravity and Electromagnetism. Just the act 
of standing on the ground or sitting in a chair makes use of both, and every "Physics I" 
student has drawn a free body diagram with a gravitational force going down and a 
normal force (caused by the electromagnetic repulsion between the two objects) going 
up. However, these two are only half of the four fundamental forces in our universe (that 
we know of). 

We can think about the third by first considering a compact nucleus which we know to be 
made of protons and neutrons. From electromagnetism we know that the protons should 
repel each other because of their like charge. But the nuclei of atoms somehow hold to- 
gether, which is evidence for some stronger force that causes these particles to attract. 
This force, which overcomes the electromagnetic repulsions and allows atomic nuclei to 
remain stable, is called the Strong forcej^ Just as electrically-charged particles are sub- 
ject to the electromagnetic force, some particles have a property similar to charge, called 
Color, and are subject to the strong force. The field theory that describes this is called 
Quantum Chromodynamics (QCD)[^ and was first proposed in 1965 by Han, Nambu, 
and Greenberg [20] . This theory predicts the existence of the gluon, which is the mediator 
of the strong force between two matter particles. 

The fourth force is the one we have the least familiarity with. It is responsible for certain 
types of radioactive decays; for example, permitting a proton to turn into a neutron and 
vice versa. It is called the Weak force jj 

In the 1960's, Sheldon Glashow, Abdus Salum, and Steven Wetnburg independently de- 
veloped a gauge-invariant theory that unified the electromagnetic and weak force [20]. 
At sufficiently high energies it is observed that the difference between these two separate 
forces is negligible and that they instead act together as the Electroweak forcej^ For pro- 
cesses at lower energy scales, the symmetry between the electromagnetic and the weak 
force is broken and we observe two different forces with different properties. Similar 
to QCD, electroweak theory predicts four force-carrier particle^ that mediate the force 
between matter particles. The mediating particle for electromagnetism is the neutral pho- 
ton, and those for the weak force are the (with +1 electron charge), W~ (with — 1 
electron charge) and Z° (neutral) bosons. 

The electromagnetic, weak, and strong forces forces described above form what is called 
the Standard Model of Particle Physics. The Standard Model is an incomplete theory 
in the sense that it fails to describe gravitation, the force that acts on matter. Physicists 

^The SU{3) color force 

''Again, the study of the SU (3) color force 

**The SU(2) part that is left over when SU(2) (g) 17(1) is broken 

''The unbroken SU{2) (g) C/(l) force 

^''Corresponding to the 3 generators of SU{2) and the 1 in U{1). Two of them are Cartan and are, there- 
fore, uncharged, while two are non-Cartan and therefore carry charge 
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continue to work towards a theory that describes all four fundamental forces, with String 
Theory currently the most promising. The papers later in this series will discuss these 
ideas. For the rest of the sections in this review, however, it should be assumed that we are 
talking about physics under the Standard Model onl}p] which, despite the shortcoming 
of not explaining gravity, has tremendous experimental support. 



4.3 Categorizing Particles 



In the last century, experimenters were surprised as they discovered new particle after 
new particle. It seemed disorganized and overwhelming that there could be so many 
elementary objects. Eventually, however, the properties of these particles became better 
understood and it was found that there really is just a small, finite set of fundamental 
particles, some of which can be grouped together to make up larger objects. In the next 
two sections, we will introduce the elementary particles and then will discuss the types 
of composite particles. 

One property of the "zoo" of discovered particles that helps in our organizing them is 
their intrinsic spinj^ Any particle, elementary or composite, that is of half-integer spirpj 
is a Fermion. Those with integer spin are Bosons The spins govern the statistics of a set 
of such particles, so fermions and bosons may also be defined according to the statistics 
they obey. 

Namely, fermions obey Fermi-Dirac statistics and therefore also obey the Pauli Exclusion 
Principle. This means that no two identical fermions can be found in the same quantum 
state at the same time. Furthermore, to accurately display this behavior it is found that 
the wave function of a system with fermions must be antisymmetric; swapping any two 
like fermions causes a change in sign of the overall wave function. 

Bosons on the other hand obey Bose-Einstein statistics; any number of the same type of 
particle can be in the same state at the same time. In contrast to fermions, the wavefunc- 
tion of a system of bosons is symmetric p| 



The Venn diagrams on page 138 the table provided on page 139 and the table below 



should be referenced as you read through what follows. 



^^We are not assuming Supersymmetry in this paper, though we will consider Supersymmetry in a later 



paper 

^^Or in other words, which representation of SU{2) they sit in 
^Is in the j = ^ or j = | representation of SU{2) 
^^Is in the j = or j = 1 representation of SU{2) 



13t 



i^Cf. section 3.2.3 



129 



4.4 Elementary Particles 



The elementary particles are those that are considered fundamental, or in other words, 
are not composed of smaller particlesj^ They can be divided into two groups: matter 
particles and non-matter particles. 

The elementary matter particles all have half-integer spin (so are fermions) and the ele- 
mentary non-matter particles all have integer spin (so are bosons). We can then observe 
that an equivalent grouping is made if we divide the elementary particles instead by their 
intrinsic spin, which is commonly done. Then an elementary matter particle is the same 
thing as an elementary fermion, and similarly for the bosons. The two terms are used 
interchangeably in the discussion below. 



4.4.1 Elementary Fermions 



The elementary fermions are the building blocks of all other matter. For example, the 
proton and neutron are made up of different combinations of three elementary quarks. 
Electrons, which are also elementary, cloud around the protons and neutrons, and when 
all three group together in a particular way, an atom is formed. Less familiar examples 
include those that are unstable, such as the muon, which decay into something else fairly 
quickly. 

For every elementary (and sometimes composite) matter particle, there is also a corre- 
sponding particle with the same mass but of different charge and magnetic momentj^ 
Generally the name of such a particle is the same as the corresponding "normal" matter 
particle, but with the prefix "anti" in front of it (e.g. antiquark, antilepton, etc.). In this 
paper, whenever we discuss matter and its properties, it is implied that the antimatter 
counterparts have similar properties. 

Now we further divide the elementary fermions into two groups, quarks and leptons. A 
convenient way to distinguish these two sets is by whether or not they interact via the 
strong force: quarks may interact via the strong force, while leptons do not. 



Qmrks 



Experiments involving high energy collisions of electrons and protons led Murray Gell- 
Mann to suggest in 1964 BZSl that protons and neutrons are actually composite particles, 
made of three point-like, spin-1/2 particles whose charges are either —1/3 or +2/3 units 
of electron charge. He called these particles Quarks. Through further experiments it has 
been found that there are six flavors of quarks total, grouped into three generations with 



^^These are the ones that are in some represe ntati on of SU{3) SU{2) (g) U{1) on the table on page 
^''Cf. material on spin-1/2 particles in section 3.1 
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the first generation containing the up and down quarks, the second generation containing 
the more massive charm and strange quarks, and the third generation containing the even 
more massive top and bottom quarks. 

As electrically charged particles are subject to the electromagnetic force, quarks have a 
property similar to charge, called color, and any colored particle is subject to the strong 
force. It is found that there are three different types of colors: (defined as) red, green, 
and blue (plus three more for antiquarks: antired, antigreen, and antiblue). Quarks are 
grouped together to make composite particles that are colorless (the color charges cancel 
out), which is why the concept of color was only discovered after quarks themselves were 
found. The addition of color to the quark model also ensures that any quarks contained 
in a composite particle will not violate the exclusion principle since each has a different 
color. Again, QCD is the field theory that describes these properties. 

Another interesting feature of quarks is that they are never found alone, but rather always 
inside of a composite particle. This phenomenon is called Confinement]^ It is more 
a property of the strong force, which increases in strength as two colored particles are 
pulled away from each other, just as would happen when the ends of a piece of elastic are 
pulled apart. We can consider reaching a distance between the two quarks where there 
is sufficient potential energy built up that it can be converted to matter, creating a quark- 
antiquark pair. The pair will separate and the resulting particles will recombtne with 
the original quarks. As this process repeats, and more quark-antiquark pairs are created, 
the end result in the whole process is a multiplication of the number of quarks and of the 
number of composite particles. In the opposite extreme, as two quarks get closer together, 
the strong force between them becomes weaker until the quarks move around freely and 
more independently This is a called Asymptotic Freedom. 

Quarks also interact with other particles via the weak force, which is the only force that 
can cause a change of flavor (changing an up into a down, for example). When this 
happens, a quark either turns into a heavier quark by absorbing a W boson, or it emits a 
W boson and then decays to a lighter quark. Beta decay, a common radioactive process, 
is caused by this mechanism. Instead of just thinking of beta decay as a neutron in the 
nucleus of an atom decaying, or splitting, into a proton, electron, and antineutrtno, we 
can go a step further with our understanding of quarks subject to the weak force. We add 
that, really, it is one of the down quarks in the neutron that emits a boson and then 
decays to the lighter up quark, keeping charge conserved in the process]^ The neutron, 
which used to have one up and two down quarks, now has one down and two up quarks, 
which is the composition of a proton. The electron and antineutrino are created from the 
decay of the boson. 



^^We did not discuss confinement in the main body of this paper, though it can be derived from what we 
did discuss 

^^Other conserved quantities are momentum, energy, quark number, lepton number, and (approxi- 
mately) lepton generation number 
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Leptons 



Leptons interact with other matter via the electromagnetic, the weak, and gravitational 
forces, but not through the strong force]^ There are three charged leptons, grouped, like 
the quarks, into three different generations based on their massesP] The electron is the 
lightest of the charged leptons, then the muon, and the tau. There are also three neutral 
leptons, called neutrinos ('Tittle neutral one"), one type for each of the charged leptons: 
the electron neutrino, the muon neutrino, and the tau neutrino. 

Some quantities in lepton events are found to be conserved]^ If we define lepton num- 
ber as the number of leptons minus the number of antileptons, then lepton number is 
constant in all interactions. Additionally, the lepton number within each generation is also 
approximately conserved. For example, the number of electrons and electron neutrinos 
minus the number of antielectrons and electron antineutrinos is found to be constant in 
most particle reactions. 

An interesting exception is in neutrino oscillations, where a neutrino changes lepton fla- 
vors as it travels. For example, we can take a measurement and observe an electron 
neutrino, even though it was known to have been created as a muon neutrino. These 
oscillations of flavor only occur if neutrinos have mass (even just very small mass), so the 
fact that the Standard Model currently predicts them to be massless demonstrates that 
there are some parameters in the theory that need to be adjusted. 



4.4.2 Elementary Bosons 



Throughout the development of the Standard Model it was found that some elementary 
particles play a different role than the ordinary matter particles that make up the stuff of 
the universe. Both the gauge bosons and the Higgs boson fall into this group. 



Gauge Bosons 



In the mathematical formulation of quantum field theory, the Lagrangian can be made 
invariant under a local gauge transformation by the addition of a vector field called a 
gauge field. As with the more familiar example of an electron, the quanta of the gauge 
field is a type of particle, which in this case is called a Gauge Boson. There are three 
types of gauge bosons described by the Standard Modelp] They are the photon, which 



^''This means that leptons carry SU{2) C/(l) terms in their covariant derivatives, but not SU (3) terms 
^^This is equivalent to the statement above that there are three copies of the Standard Model Gauge Group 



- Cf. page 115 

^^These conservation laws can all be derived from the rules we discussed above, though they are typically 
treated separately because they are extremely useful when talking about specific interactions 

^■'This is equivalent to saying that there are three gauge groups, each with their own set of generators 
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carries the electromagnetic force, the W and Z bosons, which carry the weak force, and 
the gluons which carry the strong force. Each of these bosons have been experimentally 
detected. 

Evidence for the neutral photon first came in 1905 when Einstein proposed an explanation 
of the photoelectric effect, that light was quantized into energy packets [7J. Confirmation 
of the , and Z'^ bosons came in 1983 through proton-proton collisions at the Eu- 

ropean Organization for Nuclear Research (CERN) [5J. 

The gluons were first experimentally observed in 1979 in the electron-position collider 
at the German Electron Synchroton (DESY) in Hamburg f5\. Further experiments have 
demonstrated that the gluons have eight different color states and that, because they in- 
teract via the strong force, they have properties similar to quarks, such as confinement. 

Taking into account their possible charge or color, we find that there are 12 gauge bosons 
in all, one for the electromagnetic force, three for the weak force, and eight for the strong 
force. 

The Higgs Boson 

The Higgs boson is the only Standard Model particle that has not yet been observed. It is 
also the only elementary boson that is not a gauge boson. Rather, it is the carrier particle 
of the scalar Higgs field from which other particles acquire mass. The existence of the 
Higgs would explain why some particles have mass and others do not. For example, 
the W and Z bosons are very massive, whereas the photon is massless. One of the main 
goals of the Large Hadron Collider (LHC), located at CERN in Switzerland, is to provide 
evidence for the Higgs. It is expected to be in full operation in 2009. 

4.5 Composite Particles 

Examples of composite particles include hadrons, nuclei, atoms, and molecules. The latter 
three are well known and will not be described here. 

Hadrons are made up of bound quarks and interact via the strong force. They can be 
either fermions or bosons, depending on the number of quarks that make them up. An 
odd number of bound quarks create a spin- 1/2 or spin-3/2 hadron, which is called a 
baryon, and an even number of quarks create spin-0 or sptn-1 hadrons, called mesons. 
Experimentally, only combinations of three quarks or two quarks have been found, so the 
terms baryon and meson often just refer to three or two bound quarks, respectively. 

You can understand why mesons and baryons have the spin that they do by considering 
how many spin- 1/2 quarks compose them. A meson has two quarks, and therefore the 
total spin of a meson is the sum of an even number of half-integer spin particles, which 
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will be integer spin. And because there are only two of them, it is either spin or 1. 
Baryons, on the other hand, will have a linear combination of three particles with half- 
integer spin, which will of course be half -integer: 1/2 or 3/2. 

The most well known examples of baryons are protons and neutrons. Protons are made 
of two up quarks and one down quark, or \uud), and neutrons are made of two down 
and one up, or \udd). The baryons are made of "normal" quarks only and their antimatter 
counterparts are made of the corresponding antiquarks. 

The mesons are made of a quark and an antiquark pair, though not necessarily of the 
same generation. Examples include the 7r+ \ud) and \us). 

One of the reasons for the zoo of particles discovered in the past century is because of 
the numerous possible combinations of six quarks put into a three-quark or two-quark 
hadron. Additionally, each of these combinations can be in different quantum mechanical 
states, thereby displaying different properties. For example, a rho meson p has the same 
combination of quarks as a pion vr, but the p is spin-1 whereas the pion is spin-0. 

4.6 Visualizing It All 

Finally, we provide a few tables which should help you see all of this more clearly. 



Interactions 


Acts On 


Strength 


Range 


Strong 


Hadrons 


1 


10-15 ^ 


Electromagnetism 


Electric Charges 


10-2 


oo (l/r2) 


Weak 


Leptons and Hadrons 


10-5 


10-i^m 


Gravity 


Mass 


10-39 


oo (l/r2) 



where the relative strengths have been normalized to unity for the strong force. 
Also, the four classes of force-carrying gauge bosons are shown belowf^ 



Interaction 


Gauge Boson 


Spin 


Acts On 


Strong 


Gluon 


1 


Hadrons 


Electromagnetism 


Photon 


1 


Electric Charges 


Weak 


w^, Z-, 


1 


Leptons and Hadrons 


Gravity 


Graviton 


2 


Mass 



^*The graviton is a the h5qDothetical carrying particle for the gravitational force; it is not described by the 
Standard Model 
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5 A Look Ahead 



Now that we have completed our introduction to basic particle theory, we can begin our 
uphill climb towards more fundamental concepts. As a preview, notice that everything 
we have done so far has been an exposition of how gauge theories work. Our investiga- 
tion into gauge theories has been purely algebraic (working entirely from group theory, as 
Part II demonstrates). As gauge theory seems to be the correct approach to understand- 
ing our universe, everything we do for the remainder of this series will be focused on a 
more fundamental understanding of gauge theory, culminating in String Theory. 

As we just stated, we have been treating gauge theory as a purely algebraic construct. 
However String Theory, if true, must obviously be able to reproduce the same general 
framework we have seen so far. But, String Theory is fundamentally a geometric con- 
struct. As we will see. String Theory will reproduce literally everything we have seen 
about gauge theory, but from a geometric framework. 

This should not be entirely foreign, though. Recall that, for electromagnetism, the gauge 
group is U{1). We can "draw" this geometrically as a circle in the complex plane. The 
Weak force is represented by the gauge group SU (2), which we have seen is parame- 
terized by three numbers, and therefore has three generators. As we discussed in these 
notes, we should think of these spaces as vector spaces and the generators as basis vectors 
spanning the entire space. The same is true of SU{3), though it is an eight-dimensional 
space. So, because there is a space associated with each of these groups, it should be 
somewhat obvious that there is a natural geometric picture associated with a Lie group. 

While the idea of the parameter space of a Lie group having a geometric picture associ- 
ated with it may seem straightforward, the geometry undergirding gauge theory can be 
extremely complicated, and we therefore must spend a significant amount of time inves- 
tigating it. Therefore, the next paper in this series will be an introduction to the geometric 
structure of gauge theory. Just as we have built gauge theory from algebra, we will in a 
sense start over and rebuild it using geometry. However, because we have already cov- 
ered a great deal of detail in the physics and mathematics of gauge theory and particle 
physics in general, we will move much more quickly to avoid being repetitive. 

When we finally get to String Theory (later in this series), we will see that the geometric 
and algebraic pictures come together beautifully, and that a thorough understanding of 
both will be necessary to understand what may be the "ultimate" theory of our universe. 
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